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(54) IMAGE PROCESSOR AND ITS METHOD 
(57)Abstract: 

PROBLEM TO BE SOLVED: To print an arbitrary image as 

fast as possible by using a coprocessor for graphics which "~~ »*rfZ*ii'p.jz , 

cooperates with a host processor, and operating one of data >»- «.i ••^^*r^,»|<'^''»*» 

computation streams at any time and performing an image .': ^I^j^ f ti , 

process for an image data stream. j — 



SOLUTION: A host CPU sends a series of instruction and ^ ( A Ij;'™ r -CZ, : _s<* 

data through a PCI bus 206 and a raster image coprocessor $}\+n<**->-\ f*'l Lj 



224 generates an image. The sent data are stored not only in 5" — j 

a local memory, but also in the cache or register in the raster ^ j 

image coprocessor 224. As a speed-increased process, data i>J; r~;^\ j 
streams are processed mainly by a JPEAG encoder decoder 



241 and a main data path part 242. The processed streams < "" f j J_^^"t^ - ^V* * 
from the parts 241 and 242 are transferred to a result f W Jf.^' r^fj 

organizer 249. In this case, one of the data computation V£\ <J. *J:.jJ '^Lk**/ Ljj l! v,? J 



stream is operated at any time to perform an image process )*\\ i jw*rj j " c lIIL^" , ^ 1 !S * 3 
for the image data streams. L — 
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i^g^€ : y'<-5?-y-'fXK:&?¥-r*®*©9f^a©t*^ « 

FT<&»5, J&>^#j£t"5«ia^-:*rt©#7-fc>y FT* 

hmkh 4 ? ] we^-^s^-im xt& o , t&E 
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SH)f^<-^iJ--< 7 h uj. vjmiz* * y ©^n-tj-f 
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Sifr5«*iR4 7 lc!2«©gg. 

7h ; l^XV-ytf>y&^. &^ ; ey7 HUXV«>tf> 
^(iHute^X M t U ©gg^ t 'J ^-y 7 F ti 
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l t v^-s c: t t r -& m #m 4 4 tm comm. 
immm 5 0 ] iafassf ^rhec, 

^ 1 © t* y hSc©^^^ 'J ^-yWWt'J 7K1/ 
X£^2©£-y h»©^-i?T— 7*;^ >X5/^XC«H 

tTzmxms o icE«©gg„ 

^f-^OtH^-iJCiIlT H kX© 1 0*jM^-r 
Sfc»©^l©filf^f-y Hkt. ncawufc^-^ 
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tf >>f&M${-rz> TztboW. 2 ©H)r^»© b*-y h t *tSx. 
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y ^-S?7 F kX i , S9ES*S n&<Rg* * y 7 H V 
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So 
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^nztifc®* t u 7 h* tttfaTzwrnT \* i^x & 
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^n-feyy-C i 0»#$nfclK«/ * 'J 7 F UX 
t ffiE A' -v 7 7 U > ic KM $ nt n £ ;* * 'J 7 K 
UXV-y tr>^» 1 OOES/t'J 7 h* l^Xi^-Sf-r 
StS5l2tkfeXxyyTfJ^3n-5i. -SLfc^tU 10 

fflfEri yn-fe >> -y-ic «k t) ftfc&g* t 'J 7 H l/X 

t ftfSE A y 7 7 >J > iCSffifei^ 3 ft T V > 5 * * 'J 

fcf>yroviTn^o^Ji^ ; &»j7 KkXt 
a*-* u u v» t tatfittKx t- ? jta ft s t . me 

A-y7r>;> 7*^ ^- v>f— 7*;i^ 6 3l 9 ffl b & * 

* u 7 k i^x v >> tr >yTHff-rsasfxx^ 7t&% 
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Mfflfr 0 m hfcX =E 'J 7 H kXT y E? >7*#t&fEn 7°n 20 
[«^]R5 6] WE»teft©««»*-5>tt, ME* 

X M t 'J © 2 :3*fiWttBU©#a9*fc: 7 h ux aim 
1tW*&*ttz. i*WHtr*l««« 5 5 icgEft©;* 
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[lt*S5 7] MI2^ : tU7HU'X-ry tf>^©{£«! 
^ U 7 F UX tt<K?i^-i>#^-T?c& 0 „ 30 
ttflE* * U 7 F I'X V y tf yifomwr F I'X ttfflfE* 
X F ;* * U 0«5aWft^->>#^T;& 0 . 
*8Sg#S ftltm.* * U 7 H PX * U 

*Jft<D*7±.y F7 FP-Xfcil*., WSB:*7-fe-y H*# 

in $m 5 8 ] mfK-wmiz+M xn$> o , us 
^**y^->?#^£fia*tij^-^#^£tt, m 

fB0f^^-^iJ--1'X*i:7Kl^XnIgE&^ ;: E'J©^tl-y-'f 40 

tt, S/hK0#5'<-i?D-rX^S^< 7 H kX h 

•TZmOT FVXtfy FWnJgffl^iSStJZliSft 
afT5S*« 5 8 ICSa©*)*. 

tit^ 6 b 1 iWB^—^T-^MWfJfefciDM * «) 

f|t|l«t'XV5jtf>^^ ftM%U7l«^v^t:> 




l^-^7h'l/xm, B9fEHJr^S:©^ ; E'J7 Ml/X 

UTV»5Ct*1«»fr*l9*«5 5 tlE«©*ji. 
M*i6 1] SafEMffXx-y^ttSI^ 
mi©f-v htS:©ffig^ ; &U^-v'©{Kffl/ ; &U7 h'U 

^brsxT-^y^wL, 

[ft&S 6 2 ] MfEMfrXT- y 7°«gl' . 

^nommr Hpx*iais-r-5x^y7"s<f^.5c: 1 k* 
^x-7*^©#)a^->'©ig9a7 h*^x© 1 o*s^-r 

-7*;P©fja^-^rt©ft l) ffl * U 7 h* WX V -y 

tf > ^*a*;-r « fe* ©8 2 ©w itss:© tr v v t zmz. 

-Srt fc&a t -T -5 6 2 kl!E«©77 & 
[H^a 6 4] LRUn777*ifb TUfEA -y X 
rU>y^gt*W-5^ ; 6>J7 h*l^XV7tf>^©ffiffl 
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Sijfe®DfiB$nfe^ ; e'j7 KPXTy tf>^©{K«.^^ 
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wstfcKx^^yT-a-r^tw^^n-g.t. msa*-> 
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17 mm tf >^tifeitei7 Kt-x v a 
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a. 

[is*« 7 4 ] ffif2GQB«9ffi* t y t y tf > ymmt 
sic 

WE 3 7"n-fe ? tMc<k 0 tifc&&* * y S>7 

tJUBD^d-feyij-fc.ttis^snfciKfi^ty 7 fp-x 40 

tlWEaSJM'y ^7*^ P/ty 7r©**y'7 FWXg 
ft© 1 o©<K3g;**y 7 F l"X ttf-Srf 5 t ■?•©-& 

tfc** y 7 f ^xaft*^ffifiE3ya-fe7-y-i^a7 

MISa7*a-b-;D-|c«t tJS^snfciKfi^^y 7 F U-X 
tWE^ft^^-tH' PA^rfcS&ElgiStiT^ 
« ;* * y 7 F U-xgk© i o©<K3M * y 7 F w-x £*< 
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NEB 0 dJ Lfc^^E U 7 F UXjHfcjWWEri -Jo-ty* 

ciOB^snfcfiee/ty 7 Fi^xc*fjs-r*«a7 

-^ict2tg$nT^-5Jii:si#afrsif*ja7 31112 

[W*JS7 5] ffiEflrJfcgcWifelJIW^-v'W:. 1512* 

x m * y © 2 o*f4-tnei±©#*ttK7 f px^ 

&««ft*tf C 1 <t -f * M$]g 7 4 C Eft© ->X 
fA. 

[»MtJ*7 6] ME**y7 Fi/xfcswxKS^y 

7 F U-Xtt<K38K-y#^-e<fc 0 . 

ISIS^ty 7 Fl/X^&©%g7 F l/X(4i9f2*X F* 

* y oaiftft^-^T* 0 . 
MEB#;*ftfc<KS;* t'jTH i^xtt&fleg;* * y ^~ 

5?rt©^-7-try FSrtix., ffif2*7-fe-y Ftt&fiiSg**: 
y^-i^-'f Xfc4kfiFf*«*©Bf3£fc©ey FT* 

0. *»-3»isr*«»a^-s?rt©^-7-fe:y htr»«cit 

*^Sti:-rslS*«7 4IC|2«©->Xt-A. 
[M*« 7 7 ] WE"<-5>ttBrJfetK X-eafe 0 , ffiE 

^**y*-y»*fc*a**y*c-3?#*i:tt, w 

EUfJfe^-^U-'f 7 F kX BJtgfc* t y ©£IHK 

©H«-y--fXffi»t, sffi©#^©^->'-y-'rxic«-j£ 

•T'5aO©7FkXf-v h<D~OI&Wftt$:'£tSZ.t£ft 

m«9l 7 9 ] WE^-S't— ^;n*Br«*©^ * y 
TPtaxaute*, s-^ : &y7Fi^xa:fttt«9E*x 

h^ J Ey©«i^ : Ey^-3'7 Hl/Xfc*a-^-3?7 F 
W-X4#*. WEBf^Sc©/ ; Ey7FP'XSftttME* 

x h^%y©ttan^-^RicM«i/Tv>«c2:ftmR 

t-T5IS*^7 4lcE«©->X-r/>.. 
[M«« 8 0] fl5EH«f 
^l©t^y F»©(Rg^ ; &y^-i?©(K3g^ ; 6y7FV' 
X*^2 ©If y F*©<-5?7— ^P-f >f*^Xdffl 

mas 2 ©if ? hstam 1 ©k?» h»«t o v>c t 

[H*3a8l] ttEKff^ftiiXK. 
WE^-^t— ^ft#tr)!rttlt®tta0^-^©4E-n 

•en©«9a7 F U-X *E<g1-«^SS«A5 c i$4$a 
fc-r*»*!C8 0 ICE«©->XxA. 
[IS#S8 2] t&E^-^x- 7*MR9ltt. IttE^- 
^-7>©<toS^-S?©!8!jS7 F P-X© i 



( 7 ) 
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[ffi#fi 8 3 ] mmt&fr v V 7-tM FA vVy 
>?7 h'UXt, m(f2g^$nfc<K®^ ; &'J7 Kl^xcoig 

ii * =& u ^- y 7 h v 7s t & tt«rr * tttt* a t . 

fflbWt'JTK UX^ftfclffiie L RUA*7 7 r ICS^? 
^TlEtttU. mmLRU^yyr^^r^Wct, 10 
t'J ^-^7 Kl'Xtf-HKLfcvv sute^- 

yf-yJKD^C^tU 7 F L-X^&S^ FUX 

[ft*B8 4] Mf2^-^7 L -^;i'tt. 1 

iH«©->XxA. 
[»#«85] M#m71S7, 1 57!>S2 3Rtf3 

77iS4 2cDv>-rn^iwrE«©-*ffi*nffr«3> 20 

*. 

[»«©ffWfcRiH] 
[0 0 0 1] 

BfcW©JR-r*tt*#IH *«W»43>Ka-^7-* 

H«*^-r-5fcJe>©n>lfa-^ • 7-+x^^Vt' 
Wr*t©T*«. *fc**WI4, 3>K;i-*'7- 

8©0*FKi«iLTV>*. 30 
[0 0 0 2] 

[$8fJi©«S] WMftHftO^AKC^lMaaff. P o 

s t s c r i p t (smmm mo^-maMm (p 

DL) Tf2M3n&Hlfcg?iK.fc9^»£*lTV>5. £ 

■5-b©T*s. ^©.td/i^-^faams^ffiffl-rsi 
c©j:5^ipj^*$ST#5ge-r$n«, ncEtttf 

>b#*ft*. 3 PDL-?©ttafls£J:t). 

>yft«**aicTr#« £ t fc^tf ens. 
[0003] ^-jnaasmiftiiDa*, *tt*»&*ffc 

•>a> (ras ter i za t i on) tUTftietlT 
tilSkMm^y^M-: >i >Attltt*X F C P 
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v?©li&£j£l;::fcl>T C P Ujgffii©#>fc 0 ©SB 

#£<£ffl-r -a©-?, £©fs], ^-f >cputt<B©s!ia& 

[0 0 0 4] ^77>f y£H«©£jjKfflg©;9l4©B 
T. A-K">i7«CJ:t)i«aft;*t>]t6-rci:*«»5n 

*afl3**fiW*fc»©, JPEGOEt/#S*fifft 

5 A- F £ x 7 ? y rff IS $ nr ^ -5 . 
[0 0 0 5] Bfc££j*T57U>$Hg;tt7^ 

5£^fflrtlC-e©^-^#:©Hj(H»a*^U. ^~v> 

[0006] ^^©ifcsrMjtt, «*©:/»;>*"*«& 

BiWTSllifc**. £©*§-B\ UASB«^ilAH« 

tr±^c-r5Jit*«a*uv>. ta^gaatBsft^-** 

ft t> r v>« £ £ tt no *> TOJiS;t4*^S^B*J 6 cji 
JEU cn*«a- tf©^*5l^fiC-r. SSKtt, -t 

[0 0 0 7] HK, nAStlAtUAOVttiZiSCT. * 

©»lfcriHftt4VO*)U7U'y>'3LSftT*»), ^c© 

3effc^ss«ix-^*^rartc^i(&3n&v^^c(4, * 
a^nav*. u^u ffi©bfx*©»»T»4. U7i^ 

^w*t>©-e*s&». n^T-^ic^^-sai^s/h 

[0 0 0 8] tUi*4*kiV»T, *%W©^Wf4, i 



( 8 ) 
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>y*)>if£ff13i?£t\Z&Z>. d£T<bU Ctlbx 

- * x h u - a o i -o tc » ifit> % mm \zmm-tz t , h 

Uftt>fc#)T&£ 0 

[0 0 0 9] g^, l:3C0fhSUHJ-A#iMftUTl> 

[ooii] #9lfcu7u±y<v-$:ffijLz>cpiJwRm 
teA-H'JxTiiTH n^n-feyii-iCPUitt 

ath c p uiw ^ u s»«i:»!iaT 5 a 20 

t'Jtt, CPU#a*fc7£*XT**J;5fc»#;Sft 

b^^T^^i^tc»ft^n§fci6ic, cpu^n 

T£&<&£> 0 mz % CPU**, 3^D-byU-3jt#»fP 

[0 0 13] **ft««©*Sttfcftl*.T, £©J:5fc 
v'XfA tti&^fe V>$^f y £ * *-U -C»a Lfctt 
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^tCPUIt^O BUSTS fc©T*§. 

[0 0 15] fig^yr^tfi-^-*^? 1 *** 

[0 0 16] n>bf^-^vX7 L A<7)l!;^^ c l:DXfc!" 

T#n«, «fcO-r«SLV>fce!>i&J5T*33. 
[0 0 17] 

>^-T£^i£T&oT, (1) *Xbynt^D-tl6ft 

«^7>f^Xffl3^Dt7*S*fflU (2) ffiC 

®5*-*fWXhy-A*SttU (3) mttteZtifb 
fflE*ft©5*-*»*;* h U-AO 1 ^^WlTIi 
x-^X HJ-Afc#LTH«fta*frfc3, 
[0 0 18] ff£U<tt, MBx-*W»XhU-A© 

< t fe i f«s»?i«a:fwrar:y h £jg& 

U (3) ©MaoiS*, MEWJfci^yhft 

3«&f5o Hfc, t&EX^yy (3) OTMftSttSM 

[0 0 19] *fc**W©tt©-««KJ:n«, aiasr 

fci&a&BT&oT, *xh^n-fe^*tiMw*jr9 

[0 0 2 0] **Ktt, MB^-^fWXHJ-A© 
^>&< t 1 a3WM»?F(BftlH|[ar:y h^WUTV^ 
So SfclQBifcJIIfteF-^WItXhU-AOiaEfcx- 

tO 0 2 1] «iB©»fflaSiWTtt, 
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©*ff & \z&m l rffiffl * * * «j sra * km s 

T-SWBWSTC&oT, (a) 1312*7. f- C P Ulr«t 

fflsns/tugiiSfiJo^T. (b) ^ans^ 

t. (c) SiilS^ZhCPUCDSiJffllWTtC, Jfijfc, Mia 

[0023] a 9 isxe-nfe^ty mfeoft 
fflE**yK«©«;tts#*«yiir*<^T»*©*«s 

£bK ifc, hCPU^S^T^U-fe/^U 
■TSt. n7 r niryiH:J;9^;**afc<fcr>fc.* : Ey 

©*tt*s»rrs. 

[0 0 2 4] *5WO»2©««fcJ:ft«. Bfife-tJ-fX 20 
©^t'J, *^ hCPUSCJ!'>&< tfc locn/nt 

tagBTfcoT, (a) ME**hCPUfc8MI3 

(b) ME**yXtffflB#*8±#&fc» 

as^n, 3yn-fe->*©^?!i*iifT$-&5©i'ffiffl5 
n 7°n t -y-©fefe ©* * y £ zmm csa o a t 
(o tata^^m **y 

[0 0 2 5] *aHr9¥ft©ftfPft&OT'«fl4©# 

'Ot^ >^+©#^-©*a.-©^« (1/3, 1/ 
2, Rtf2/3) dinya-t^U-lcfcO^ffStxs* 40 
T?» IWECPUCJ:«»a«t+Kajn«Ci:fcJ:Offto 
ft5©*<Mi;bV>. 

[0026] firF©»»fcwin?tt. tt&mmmvm 

©«fl*MRLfc**S. tt£H 1 713S7 t-t-OWMiW 
[0 0 2 7] *fc*3MBo— IMifcifttf. *7. hffiS 

wiBThxhiiti^eicfe^hfiBSpttys^ 
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y7h*l^£95fS*7s hMSgB©^^y©^Jifr-r5tl 

[0 0 2 8] #£b<te. WIEKS^tyf— ^;H4lW 
y © 2 -3^ti^n^±©^Fig^tC7 K l/XBItg/S 

[0 0 2 9] Hfl!S{EiB<»a^ ; &y-7 7fcf>ygfi 

BHfc. (E«*6*S7FU;*^©Vyfc;>y£*«T 
**ftx> h y 7 7tM H • Ay 7 7 

TV>*©*iS*bV>. Z.CD%&m)Vv?7V'( K • A'y 

7rlttci> h'J©rt, LRUSl;S^xx>hij 
£3S$-T5. ifciOiRgifea^^y V-y tr>^gg« 
HlC. WBE^ftffl;i-y f? 7iM K • A-y 7 7 © 1 
-tttBU©x>HJSH:tt, lof-p^M 

©*«M*LU. 
[0 0 3 0] *fc#fglS©Sg2 ©fi&tSCfcfttf. *X h 

**s~r s ^ ^ y 1 1 1> ^ ^ y ffiffi -r n 7"d -t 

fc 1 •QgS;{4^tlW±©iRi®^ ; ey 7 F kX£fjfE*7. h 

jaa^fi©^ * y ©Mjfc-r s«»37 p ? tr >y 

[0 0 3 1] 3&*ft9!©ft3©tt«tK£ill£. *Xh 

m * y ic#*ft*«k©«i>i * y ^-sxMrr 

8* * U £fi*.-5*X h 7"cHr y ftfc 3 7°n 

^ * y 7 h vt, tMfe-rzmmr h fr&ojjfje*© 

$ nfcfiSi / * 'J 7 H t UfS/X y 7 r 'J > ^SBfc 

J: 0 Stttei&tettfc* =E 'J 7 H V7, K >^ t £tfc« 
r*lfc|(*«fc. «9fE37°n-feyy-|rJ;iJM^nfc<S 

8^ =e u 7 h ttamny y r y >ygBicfe»an 

TV>5* ; &y7Fl'X-Ty tf>^©io©(K8^^y7 

h*pxt*t-a-rsi:Wfatkfe^B*t*ii)fe-r«i:> -a 

U ft * * »J 7 H kX v y tf > ^5 Strffi 3 7"D -tr v y\z 
fea7H^*ffi#fr-5gfi«h, «ME37*n"fey»J-fc± 

vmxzntc&m**: u 7 h v-x tuna/*? 7 r y >y 
gBicsft»tt$nTv^ t y 7 h* v y tr >^© 
vi-m^ofisi^t y 7 h wx t^H&bfev^tasit 

fefefi*^M-rst, MIB/ty7r'J>^fi*^— ^ 

x-7*;!/^ mbtii bit* ^ u 7 ^ bxv y bj^tflE 



( 10 ) 

17 

*^j7\tux\zttfctz>%iM7 Pisx&^fr, me^ 3. 

3. 

[0 0 3 2] Sfc*38«©SB4©««lC«tn«, *Xb 3. 

h^^ui^itisttftofis^^u^-^swrsfi 3. 

«^^U*«^S*x h^nir^+J-l-Sitt^nfc^yn 3. 

tl{£M*=ey7 FlsAttt&TZtoimT KUXS^UBr 3. 

ffi*©<ES*a^^ij7H^vyt>^t*jrt"r^^ io 3. 

u 7 k ux t we^y 7 r u >^Sfiica&»»snT 3 ., 

7^ Mf2nynir^^icJ:0^$nfc<Kg^ ;: EU7 3. 

Hi^xt we;i 77t'J> if mm \z&m £ tlT V > & * 3 . 

U 7 K 7 fcf 1 OCCHRSL*^ U 7 K l/X 3 . 

HR^^ttJElfctt^xy^Wte^n^t, Hftbfc a. 

**V7 \*VX^y\i>>fftZMUuzfu^yV\zmm b. 

7KUX£jgftT£Xxy:/<h, MEn^D-byiJ-fci 3. 
D^ant^BB^tUTHl/XtWEAy^rU^^ 20 a. 

fcg«E»irt * nx v>3 * * y 7 k i^x v y tf > y © t> . 

v^rn*o<Bi^ * u 7 h px t#HR ttwait 3 . 

<ftXf-v7T»£Sft*fc, WEAy7rU>y8K 3. 

^-vx-y;^^®0ffltrc^ : E 1 J7 Kl/XT^tT> 3. 

^TE^fr^KSffXxy^tSWb, jfflEttOfflbfc-* 3. 

: e'J7Kl/Xv^tf>^|9Enynir^^lcJ:Dg* 3. 

zntcfcm** u 7 h i/*£#j6-r*tta7 h i/xt^- 3 . 

»E^-^x-^38^ft©*aW^-i?KEf|jE 3. 

*nTV^. 3. 
[0 0 3 3] *fc*»Bfil)JB5 0««tJ:ntf, 30 3. 

^ystm^fA^oT, *^h^n-fey*« 3. 

mt, *E*^hynty*e«fc^aniwB«i^ a. 

tuftjBSTs^M^yt* fiM^^uftttffi-r* b. 

3^n-fey*i, IME3^n-fey*fcW*ati» <K«!* 3. 

*V?'-7Jl'**M\sXm&37n*yV\z.frBm&2 3. 

ftfc 1 ^sfctt*:ft«±<Z)<Eg* * U 7 H VTs £tJI2*X 3 . 

h/*ucD#*T**a7Hi/xcvy tr>yr*«a 3. 

•■7HkXV^K>^K«fcft*f*. 3. 

[0 0 3 4] aTOftMfeilWTIl, a#«KJfflS<D<l& 3. 
©«#&#fiRL&**6, ftKBl 5 0 7iSl 5 4£*© 40 3. 

■»RflfcflEBanfcV>. 3. 

[0 0 3 5] x/zl 

mm®&Mv>mm) a. 

rg&j b. 

1. 0 BBOflMlfeKH c. 

2. 0 x-^MJXh d. 

3. 0 &m&RZfi&<DkkM 
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4 tg*©X HJ-A07*-V*/ h 

5 7x h U 

7 ^(D^^-FRisnft 

9 U5?x*7£i:xtv7*©*i*» 

1 0 ^3>hP-7 

i i p-*m^v?x?7 7^f;^^a-MKW 

1 2 l/^X^©U-H ■ 5< hfflS 

1 3 ^Uxy 7©U-bV^-1' hMS 

1 4 C/Uil 

l 5 ri7 p nir-/^©x-^^-1'7 r <hx-^iS^ 

l 6 ^-^jBBfcfi* 

1 7 7^ir5 1/-^*- KcDHfiijD.a 

17. 1 -&f& 

17.2 eamsA** 
*-a»a#5-£iB (sogcs) £*e-k 

17.3 J PEG«F*fc/fc*|fl; 

1 7. 4 
1 7. 
1 7. 
1 7. 
1 7. 
1 7. 
1 7. 
1 7. 
1 7. 
1 7. 



iU5£DCTgg 

(A-7h»» 



D- 
1 7. 
1 8 
1 8. 
1 8. 
1 8. 
1 8. 
1 8. 
1 8. 



5 
6 
7 
8 
9 

1 0 
1 1 
1 2 
1 3 

■#;pdma#* 

1 4 7n-fi 
7£ir^l/-^#-Kco^>^-;P 
l 

2 MUVAy7r 

3 »**-jJ^-f1f 

4 *'<5>F*«^-f1fB > C 

5 ^Of-W^ar^ h 

6 f-^*t^r>a3>hD-7t*t7 



J PEGff%^£-H 




( 11 ) 
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3. 18. 7 
3.18.8 
3.18.9 
3. 18. 10 
3. 1 8 

t— 7>5 
T— 7^7 
7^-^1/9 

t— o 
7^-7 r ;n i 
t— 7;n 2 

T^-^H 2 A : 
T— :/;H 3 : 



1 1 JiiZH >^-7 X-73/hD-7 



*^7>KIW1 10 
x-^&f£l^x^7*-vy h 
SOGCSt-KfflTK 1/X£t8 



t— i 6 
r— ^ i 7 

x- :/;n 8 

7/n 9 

T* — ~f)V 2 0 
1 

f—7)V2 2 
T~7)V2 3 
4 

T*- ^2 5 
T* — ~f)V 2 6 
T— 7^2 7 



7x7f7 KUX 

A77>^r/l«f-^l/ffl/t>^ 

ft<fr7- K- v-f ^-^3- K7^ 
<M*7- H- V-T^-^n- K7^ 30 

f«M*7-F 

♦^*^5>H-tt*7-H 
♦♦7-H 

♦^*^5>H-lt*7-H 



<M*7- H-v-r±-^^n- H 7 



40 



So A-H*i73^ 



[0 0 3 6] tfaa^MWTfS, 2^7h'J-A^ffl 
v>*«fi£t>pJfilT*0, A-F7x7M'-F:*7*«8 

[0 0 3 7] 2^h'J-AMW^(h^ 
li^37 o n-fe^ii-0 f )A»K^x7^t^ HirtKKKJS 

3. 1 ^XhU-A7~*x^VC9-/&$/£ 

M2 0 1 b&HT**. *rt2 o l k: 

> r, J7> F 2 0 4*^bt*XHEl^ ; E>J 2 0 3\Z 
SttSttfctfX FCPU2 0 2^6rtS8*P*^ h3> 

vXxAiCfcL t^I/-f>f>^fA7 p P^7A, 

-^^fAliPC I/U^f>^7x-X2 0 7^^L 
TgS*PPC I AX 2 0 6fcftl«T£tlTV*5. &43* PC 

^)n>bf^-^ v7t^A, Wtcv-f ^nV7b»?>f >H 
7X («PK) t4 >y^fA*»tltv^ 

yXfAl:^, PCi;U2 0 6AififooWS. PC 

1/1*2 0 6*fflV>££ifc«fctK PCI/W>^7 
1-72 10, «Ot/H^2 11, D-*JM : EU2 

1 2&i:*!gfc*ttl^*tt*ft©PC I#-F (Mx. 
« 2 0 9) £«/£2 0 1 fc#ttLTfflJB**£i*^ 

[0038] *Kia*i(«Trti, ^-$?eiwbt*s 
stifc^57-f y^7ftatwafcr«fc»K:» 7X^ 

■•7^'fe5W-^*-H2 2 0 
7^*91/-^*- H (PC I AX-f >^7x~X2 2 
l£ffi;L£) tt, ttCPC I*-F2 0 9fc£iR|«fc 
**hCPU2 0 2itt, a^^fctt^Snfc**^* 

T*n«, ■*7^-t5l/-^*-K2 2 0*SK*X 

(a) 

(b) -ttftfiara** 

(c) JPEGff%/« 

(d) 7>k>y*. TMWfc/tt* 

(e) NMAOTft (ftfe) ft* 




21 

( i ) A- 7 h->M8 

(j) *U 3 bMfW 

«n^n-fe7*2 2 4II^^n/:P-AJM ; t'J 2 2 
3S:fix., ^^Hfla^oiryl^ 2 4tt*7hCP 
U2 0 2^?»©ft^^ScfV^T7X^H<ft7i7-«r7U- 
*#-K 2 2 0 &fi»-f£. nyn-tr^ij-2 2 

4tt^Ml*IltLSI (ASIC) TW^i^ 
LK, 77*H*:j ynir y-y-2 2 4tt, 
W<tt)l^oy , J>?-f/WX2 2 6 £J^-T> 

\Z. mm7i7±yU-^tf-\ i 2 2 0*4, 

X2 2 7#«*.S;JrC43 0* *~*U>^xXh*fT 

[0 0 3 9] *frE-KT?»4, *7 KCPU2 0 2**P 
C I A7 2 0 6 fc^LTH^rfM^x-^SaiWU 
^7^H^37 p nir-/-9-2 2 4TfH«o*)*ffla*fT 
5. S8Stlftf-^HD-*JWtU 2 2 3 0#&& 

^7^®^n7 p n-ir^i^2 2 4+©**y:^x2 3 
0, $»^V^in7 P nir^-9-2 2 4 4x^1/^7^ 2 2 9tC 

[0 0 4 0] B2tt, 7X^i*a^n-b^2 2 4* 

2 2 3fcjim-r*fcJe>©n-*;M ; £U«l««B2 3 6£ 
IflTUS, «iB>f >^7x-7fel«fiB2 3 714, 7 

-^7-f >^7x-7gtfP7*- h^te^b*x^-f 
>^7x-77*-7*; h&£©K?7*-7y hftflj 
UiB-f >^7x-X«»P«2 3 7Htn-*^ 
%UMWB2 3 6 trt«J»anTV»*. D-#;M* 
UMMf2 3 6 £»»f >^7x-XBI8IS2 3 8 ttt 
Mrt>*y3L-Wfy9'2 5 2*^LT««S*IT 
A*>f >^7x-XX< 2 5 2 H^ISffi 
2 3 5 £ft$3ftTV>&. XW>*7x-'XX-fyf 
2 5 2tt*fcK£'feJW— tf^-ftf 2 4 6tf-m 
y^a«»i«2 4 0fc»il!*tlTV^. AW>^7x 
-77-f y?2 5 2*4* *f-f>^7x-XBW2 3 
7 tD-*;M ; EU»W«2 3 6* 6 ©5*-****? 

SPSB2 4 0, t!^-fe^*-Jf^>f-!f2 4 6fcCar*fc 

tmij H^^^x^i«i2 3 an 'wn 
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«B2 3 9lc»IK*n, n^Dt7^2 2 4 tttBLT» 
fPTSffi*^-;^ 3 9»oTl^ ( 

[0 0 4 2] v ya2 3 0 tt, fttt^tlT^ 

Sx-^*^y->i*J»K2 4 0(O$fflTTiW-f£o 
X-*** y 2 3 0tt«*<Z)ffl&t;:i5^TfflV>£ft 

5**, 37^0^^-9-2 2 4{z&^Ti\$m$@>m2nz> 

10 6*16. ±jfi©K*ft;fflatt, ±iLTJ PEGflMHb 
/tg^&2 4 l^^^>X"^/^7gl$2 4 2lC e J;oT*S 
»cpx-^7 h U - AOflia**ff *>n§. 2 4 1 , 
2 4 2 1tjtm\Z tf tr-kfrX-jft-flf 2 4 6 <h 2 
^7>H*-^-fif2 4 7, 2 4 8(C^S$nTV^ 
«ft2 4 1, 2 4 2*>S©ffl33ttfcX HJ-A 
fe**-^^^ 2 4 9Kfi2IS*U i&ST?»tl« 
il^S7t-77 hfflS**frfeft*. ^^iSS 

2 3 0 KJOX-T, t?^1r;P*-^-f1f2 4 6 i:IS** 
20 -^<1f2 4 9^^WiCV;U5 1 a-7hfll (MUV) 
A'7 7r 2 5 0£«£TV>5. &&;*-#:Mlf 2 4 9 

2 3 8, n-*;M^UMW«2 3 6, MO?7x 
-7$HfflgB2 3 7 Kffl^ftSo 
[0 0 4 3] H2*©/ai»TS3tlTV^J:5tC, £ £ 
(31 3 CO) 7-^US2 4 3*, JPEG^ft 
/% J %&2 4 1 <h*<>^-^A7gB2 4 2 <hUofcffi 

<oa7 o**«-Wffcftff-r* fcorafe^ z. * Kaicsti 

[0 0 4 4] B2©AS I C©^RM4HT©«fc'5& 

/hSfc, *l4H»Wa:«»«fl:ftfe4i;s*35cV^t3j« 
«MI#"tt4, £© e fc5fc/h2fcHft#{fc 

§o sic, yu>^ts**Tfcas^i;«t, ^~ 

^**7 P U > * rt*IM» LT v>* Wis: a v>*9M0flttbP 

»fi**oKaK:ii**ai«-r*i 

t#&RtaD, y7h-?X7*fflV^7^D-fJ;t) 
t>A-H»>x7<0ii!iSfefc«§7yD-^©^« : *b 

K ■ ■ - , ■ 



( 
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0, ##K*«ftfc©£ftoTbS'5. £tz. A-F* 
x7©IMP*t!-Ktt, 

[0 0 4 6] CttldttbT, ^AS I CcDx-tf-f > 

BfcA-K£x7<0«*<0«ffiA* (a) BJIbT&O, 
(b) H^UffSnsciiiti&vv <hv^»<'<** : 

[0 0 4 7] Zl©«t5&|i^5, vo<i^fy/ 
£&T, A-F£x7©T^T©«tt*T#££tt7£ 

f-f^i:l/3!i#5, A-F*x7**ft«T*c:ifcb 

B^T«&T#5K2ffl<ft5fc«>, 
[0 0 4 8] *ft©x-*»3/tXSKW-3i, 

Tftfr^ *^>x-*/«*2 4 2#J;!Jiaffl»ftffl 
4l©J;5ftJPEG«F*fc/fl». **vwaflrtHfc 

2 4 3 *»*«»&fcttX> b D K-ffr*flS*A7 V>«F 

[0049] si^ »ifta»Tv>*mfc, Aava^ 
fciiMfl3ftB*£i^**i£'bfc* a-h^x7S 

[0 0 5 0] eiciTffiBttcttt, n:/a-feyU-2 2 4 
***XhCPU2 0 2 (HI) olW»©TT?*frS*l£ 

tt, CBus (C/U) tmitl&ttWtA2 3 1 ICJ: 
oTnynir^*2 2 4£i*f£$i*-£. CBu s 2 3 1 
\ii*Hieti<D : Ei?3L-A>tb&'ky \* (tt i © 2 

.31) fe*trt=*d-M S tt - 2 6 .ti l£MNl*Htett 
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l^e^n-?nc0^va-;U2 3 6 - 2 5 0 *TCC»»iBt 

[0 0 5 1] B3tt, Mm^imt^J^-)VU^7s9<D 
8i«ftK7>5h2 6 O^Ucim K7^ 
h2 6 0te, 37 P D-ir^it2 2 4 

2 6 1 £$t^fm&2 3 5 t^in^o 
^LV^^Jzl-)V2 3 6 - 2 6 OlCte, f^CD^X^ 
2 6 2^^tl^o 

3. 2 ^h/a^Dtylf'+a-OJ' 
10 ±&©7-*7 L £9 L *lCJ;tUf, *X 0 
2 tHfcn 7u-fey1i*2 0 4 tora*r^»is^+»tci: 
t»nTVi§21t36^ST*SCl(!:**to^5. b^bftrt* 

*x^*ttW0fc©TI4ftV>&», HTtf^D-K 
ttfcfWIEA- F*x7B«S»fcbTBt«-r*. 
[0 0 5 2] Sft(Z)a>ea-^yXrAH SjM;*^ 
U 9tm o fz « left* b & © * * y *l*Sft« 

[0 0 5 3] Htt»ftn>h:A-*A-F*x7*MET 

tt, CPUi1*OT©3:/n*?tf££«;iL -c-n-tn** 

SiilcCPU^^USSHOST&Kr^T, n^nir 
y ttt SlE^ ^ »J WIfcft 0 C P ui: TDfftS 

30 «*«ai:sn«2:fc«fflET*fc»fc, cpu^nyair 

fett«©BTr»*bv>i«*i«v>. 
[0054] frWfcWDSTsnfc^^yftwv^ntf, 

«»«K»6;s**2:t«cf^rBta:*. n«fc, 3 7 p 
n*yiJ-3Wffla©*ff*IIA«*TCPU*^py^b 

40 *6©«a©ll7*»6**-f>^5^hfll*©ffl«fc 

[0 0 5 5] »ttflBBft©ffifc, :^5ftyXfAT 
tt»W6:^^U*2fc»bTb&^K#ftb&Wn« 
ft 6ft K ^©D^a-^XxATJSa*©;*^ 
■y«MXfc*#ia«fcftoTVW&#, 3K©**Uft& 

:| lXi^<D^xATtl ^ftt^%Uft^6%>+M: 
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[0 0 5 6] cn^vmmzm&t&fc&iz, ^ta 

[0 0 5 7 ] H4(C, (*XH CPUtn^D-feyiJ- 

[-0 0 5 8 ] H4IItW, CPU2 0 2^>7fA^ 
OT^TO^^EU'ffffiSttiebTV^. CPU2 0 2 
ifi, §H\ *^N^ynt7^2 2 4T<0jflJfl§<Z>fc#> 
^'J 2 0 3 &*J9MST£. 3/Dt^D"2 2 4li 

0 2 £*WbTV>5j**y 2 0 3^£ft* 

1 0 2 2 SHffrSCt**T#«>. £ne>oft*<z>^ti 

-f*ltt««l 0 2 4^M^ : eU 2 0 3 

ifirt, *fc^^U 2 0 3*5*^5>K*tt*atrz: 

2 3^«*1 0 2 4£fStrf£lCgT£^ :: EU 2 0 3 0 

sa, »a©*»is*aMfcft#?-*. 

[0 0 5 9] CPU2 0 2H 3^Dty*2 2 4 fc«t 

t>T*ff*n«**i o 2 2*triE-r«ffla*t)ff'5. 

CPU2 0 2 i:ayDt^2 2 4 k<DWl0tm& £H 
*\ZTZ>rz&\Z, CPU 2 0 2l;:J:^T£/&£ftfcft* 

tti o 2 2(cs3n«i:5it*i~ f >ysnx*^n 

^n-byU-2 2 4\Z&\,*Tmfr2tl*. *;x- 1 0 2 2 
+©#<MH4, n^n-b^U-2 2 4©fcJ6lc*^hCP 
U2 0 21^-3 TWO STSft&JW^U 2 0 3^ 
**7>\* 1 0 2 3^*1 0 2 4*#»?"*E£*iT 
**. 

[0 0 6 0] H5fc*"TJ;3fc, Ztlt><D9&m*fT5tc 
&\Z, ft*£*£g&l 0 3 0, **UW8«1 0 3 1,* 
a.HPa*l 0 3 2jW**BS*ITI>*. dtlS-T^TO 
^a-^tt** h C P U 2 0 2 ±T¥-^D^X £ b 

[0 0 6 1] nynir^-y-2 2 4fc*tt**fr**tt* 

*£j&«bi o 3 o te^v^Ttrt^ti, ^uwaai o 

3 lo-u— tr^**j«bTflfej*sn&**©*^9>H 

1 0 2 3^»*1 0 2 4©fc»©««fta!3ST*. £ 

fc, ft*£$SB 1 0 3 0H *cx-«3ffl5 1 0 3 2 CDIf 
-bf**«JBbT, 3^Dt*;*2 2 4Tf»Tt«»* 

[0 0 6 2] #ft*«t3^D-feyy'2 2 4K*^T*ff 

£tt5£, cpu2 o 2iw^u*a«i o 3 urio 
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*<jE»sftsWfjan> ^tysaan o 3 1 

amO3 2fc0Wt&Kft#bT43D, ?fJfflpJ^vX 

[0 0 6 3] H6tt, n^D-fc^U-ft^+a-l 022 
<0*J*£«£«lCiSbfcBT*£. A*Ptt*7hCP 
U2 0 2lCJ:D^>x^ >^ftfr*i- 1 0 4 0 KJf A 

io sn, 3yp-fe7*2 2 4(c«tt?T»*fflsniifTtw» 

2n& 0 n^n-fe^-y-2 2 4K*tt£HffJ&a**»7T 
ft*tt^U->7^y*i-l 0 4 lfcfiSId 

3/nt7f2 2 4*jffla*»A&aTft»***i&» 

[0 0 6 4] ft**^-l 0 2 2S#ttH£»5W4» 

w^BC-y-fXoaEi/x^^rtbTmsn*. ft** 

^-1 0 2 2lt CPU2 0 2l:i;mo^tn7 P 
u-feyii-2 2 4K*^***0*frtft»libTVi-&. 
[0 0 6 5] #ft*<0:*^7>H ft* 
20 £j£f$(Cft*£/£gn 0 3 OjfcSflDB^KfcbT^U 
«a«l 0 3 1 (05) fcioTMOSTSttS. «fb 

S^^UMSl 0 3 1 t+^-fg^ 1 0 3 2 

\Z yXf J; 5 K bTV^S. 

[0 0 6 6]H*a-f!«10 2H n^D-feyiJ- 
2 2 4 1 0 3 0fc«koT*/*3ftfclfH*4 

nfrbi**.**^ wai-rscii^T**. b^u * 
^yffnti o 3 lic^oTSiDST^n-sft**^- 

30 1 0 2 2t^V 2 0 3 Wtl«, 3^nty 
*2 2 4*^<«rO*H*ta:Vi*\ **W44>fc< ifc 

So cciwrc, *i-HRMio3 2fc**y*a« 
io3i &cDm-cttfmAA^Rttdn«. 

[0 0 6 7] ♦**a-«»10-3 2CtoT, «S7 

bfcft** r?ij->7^j u ■wsfcaosT&n 

fTlbftVio ^UWMl 0 3 ljOTJAWtefc;** 

Wb&*#fctt, *aHfa«io3 2K:^y->7y 

^Sfc^U 3^U-fe^U-2 2 4«CfcoTfctt*ffl 

«Ufl«l0 8 1H, CPU2 0 2^/D 
•fes^S ^4&&d, **Ufetdytt-fey^2 2 4fcH 
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[0 0 6 8] *tU»a«l 0 3 

^a-t^SE 1 0 3 2 lC^>xV >y^*a- 1 0 4 

o+offla't 3 **©-*. WA«*»***7r 

^©VK^id«7t5iTCPU2 0 2II«^n 
•y^^nscitlc^-S. 37'n-lr7-9-2 2 4tfH*©v»< 

ti. R#*»t&rK+#fc* : &y**»'&n*. fflg+© 
©ahs-b^x w >y#^-*i- 1040 tt#ffibTfc 

5. *<©»£\ CPU2 0 2*«#itM>r-f>^ 
0 4 0 4 3 ©-S8^^ , ;->T-yy-r-5eii: 

icj;o, ^tutaffli o 3 i fct^T+tffrjjt^ya* 

*M*Sft. 1 0 3 0©B*£*lfc-tz:£j5ifC 

[0 0 6 9] 3^DtyiJ-2 2 4*^>-r-f >#frft0 20 
«Atf¥5W<*frl*7f * *TT^I bfc t bT 

-XC^Ctt, ^^E'JWagCl 0 3 1 «T^T©<> 
yn-tr y U-rfMfcMRT-r* * Tfir*?- * «t ^ 
?»&©¥&£ £3. yXfAOSfiffl^t'JtMS 

T. JinicJ;t)#^*ffil 0 3 0©M**«fc-T»c+ 

[0070] ^©i^apttutaai o 3 1 t+a.- 

«a«l 0 3 2 iK0tSW»fP»r.«fc»3, v^fAlcM^ 30 

*i&j« ; EUfi2 o 3©#Tr3&*«fc:x;i'--:/y h*fi^c 

SitAiTftS. J: O^fttijt^y 

a^n-fey-y-2 2 4 #g.bV> 2 0 3SfoT<D« 

* y < t tiwiw* * ©©tttBtiftfc-r*. 

[0 0 7 1] 0 3 0rive,©g££«5t;fc-riR 

fc^UWaWl 0 3 l«tfr5«L9:*7-?7eGl~Ffc* 

*u«a«i o 3 lOTwttdfc-rc+^a^^u 2 0 
3*msn*a»£9:M»'**. +#fc> ; ey*<#sn.& 

1. flfflUJHefc^y 2 0 3Tg#£$fcf ditfli* 

-5 

! "2i' *HT©fefb&H^ U->7-i7t& 




A&«tt# <«*.«. 1/3+2/a) ewmtst 

i>\ £S©;<*U&teffl1"*£ia<fe;froTV>5«r5g0 
**S1WI1-*&£. #!©:*:/-> 3 >£fflV>3iL£fcT 

[0 0 7 2] H7lC::fcV>T, ^tyfill 0 3 1 

a-*a« 1032 £©nmr©tsnftf£fc:ft]*.T. @£ 

fift^a-A»;7 7 1 0 5 0*i»nfc*^»C«*a- 
taSBl 0 3 2*i37'nt'^2 2 4 <t|SlW«:i:5ii: 
£©£3&1*fR£0 7 lC*bT:fetK 'Ox 
-f >^#^*i- 1 0 4 0 1 0 fl©fir<fr©*a- 

irtsns. ^ic3 7 p D-fe7-y-2 2 4CA**n*#^tt 

[0 0 7 3] «*Wttftfc»#Ctt. 0 

3 2(4a7 P n-tr-yy-2 2 4^>x-f >>ffirti<Dto%tf. 

¥#©«a«i**.*3:-cfWM-*. h©**«cj:d. a 

#tt*i-ffaffl 1 0 3 2 ICioTif A$n-5£rLVMfr 

[0074] ffbv^^r&x^r^i-y >y-r-5is©* 
i-faaji o 3 2©i&fm£jrF©«9T&3. 

1 . ft<fr*jL- 1 0 4 0 C+#fc««;WgoTl*5i6vr 

2. ^TVi&V^-g-ra. 3 7"at7t* ! 

3. &r b t^f5M3-£* i - KJ? AT -5 

S«l 0 3 2©lWm«T©jIt)T*S. 

1. #^*<lK7bfctnya-t:->-y-2 2 4^e.«*^n 

2. ^U->7y7 r SnTV^V»*7bfc#^*»*-6« 
£fctt, *l^7Lfc**£*a-*5liffir*-« 

*f L-H***£jW*R©**£jMB 1 0 3 0 ©aim 
OT©ii0T$.S. 

1. #^^v>hi o 2 3c<3Rt* i ey***yw 

9V 1 0 3 1 K»*T* 

2. «8f*dlr«ft**r* 

3. 3^n-fey9Hr*ft*a-»a»l 0 3 2 Cffi&L 
SU©»flr/n**fc«tt:i-H©jrejRUfc*tefirF 

[0075] /tuff a 

ALLOCATE MEMORY 

BEGIN 

i f s#£ffifcT©fc+#&**yrt<#e>n&ir>i:-f 
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30 



THEN WA I T_FOR_I NSTRUCTION 
END I F 

if m^mtzt<D\z+ftt£*^)mmmt>nt^ 

THEN X7-££B^LM£ 

end if rnvrnxtzttyimt 

*aHfJI 10 

SCHEDULE_I NSTRUCTION 

BEGIN 

if &ir*^-\z+wmmmt>nte^£-t%t 

THEN »*Bf3t*CO»**3^D-fe?;-y"3j«*7"r« 

END I F + 
END 

WA I T_FOR_I NSTRUCT I ON (i) 
BEG I N 

♦❖i^Tbfci3yp-ty^6«3ftStl**-C«f 20 
WHILE *7LTV>Sfc<D©^U->7^;/SttT 
DO 

IF *©*7bfc**k:^U->7yy«IIB3&«*to^ 



THEN !7V->7y7mm$:VmftT 
END I F *a-*^*57bfc*^*8'J»f * 
DONE 
END 

GENERATE_I NSTRUCT IONS 
BEGIN 

ALLOCATE^MEMORYWMl, fft^r^ 

SCHEDULE^ INSTRUCT I ON*fftfB 
END 

3. 3 n^nty*©i/yx^©RS 

[0 0 7 6] i2t^>?a-;H:MlT, 

[0 0 7 7] l^**0R9I 
[0 0 7 8] 

[0 0 7 9] 



m 1: 



eic_cfg 



eic stat 



eic err int 



eic err int en 



eic test 



eic_ J gen_pob 



eic_high_addr 



eic wtlb v 



eicjvtibj 



eic mmu v 



dcjnmu_v 



Config2 



Status 



Interrupt 



Config2 



Config2 



Config2 



Configl 



Control2 



ConOg2 



Status 



Status 



-fiT ft y h 



TLB Invalidate/Write ffl^T f* 



TLB Write JB0tt97 HUXtfMPtfy 



&&£ftfc Most Recent MMU iSSir ft 



MMU T7xy^£ftfc Most Recent 
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1 1 
til 




19 
61 






Recent IBus J£^S7 K 


eic_rp_addr 


Status 


PCIBus ICT^-feXf ZtctiXD Most 
Recent RBus ffl WI7 KU-X 


eic_ig_addr 


Status 


ittfttt Bus {^7i7 -feX-f Sit J6<£> Most 
Recent IBus m7 b VTs 


eicrgjJata 


Status 


fcEwtfy Bus ^ / y 7ZA 9 o/coDC/j Most 




Ays ~f 


mm 








Imicfg 


Control 




Imi sts 


Status 




Imi err int 


Interrupt 




lmi_err_int_en 


ContooLZ 




lmidcfg 


Control2 


DRAM □>7>f7l/-y3>I/y^5' 


Imi mode 


Control2 


SDRAM HI/ S^X* 



[0 0 8 0] 







pic_cfg 


Confifi2 




pic_stat 


Status 




picjHr_int 


Interrupt 




pic_err_int_en 


Config2 




pic abus cfR 


ControE 


abus ffln sa—^a >t$m 


p{c_abus_addr 


CoafiKl 


ABus transfer K I/* 


pic_cent_cfg 


Omtml2 


Centronics E n > 7 W ^ U - 3 > £ ffl 
® 


pic_oent_dir 




Config2 


Centronics UP h ■ 3>r*D— 


pic_reverse_cfg 


Cbfitrol2 




pic timciO 


Configl 




pic timerl 


Configl 





f 
f 
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§291 




mm_cfg 


Config2 




mm stat 


Status 




mm err int 


Interrupt 




mm err int en 


Config2 




mm^gefg 


Config2 




mm_diag 


Confifi 




mm_grst 


Config 




mm^gerr 


Con£ig2 




mm^gexp 


Config2 




mm_gint 


Config2 




mm active 


Status 


Cfu-;vV7?y-<< 








mm 




ic_cfg 


Config2 




ic_stat 


Status/ 
Interrupt 




ic_err_int 


Interrupt 




ic err int en 


Config2 




icjpa 


Controll 




ic tda 


Configl 


^HU-ATodo WJX* 


ic fna 


Controll 




ic inta 


Configl 




icjoa 


St&US 


Last Overlapped Instruction Sequence (M. 


ic_ipb 


Controll 




ic tdb 


Configl 


B XMJ-A Todo V*JX9 


fcjhb 


Controll 
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36 


ic intb 


Cdnfigl 




icjob 

_ 


Status 


Last Overlapped Instruction Sequence (II 


ic sema 


Status 


A X HJ— A-feV?* 


ic semb 


Status 
















dec del 

******* O 


Configz 


dcc ^ y J a y v y3/l l/z/ys ^ 


dec stat 


Status 


ttHggXx-*Xfc> h 


dec p.nr int 

**Vv 1*1 J 1111 


Status 


DCC :7 X l/VA? 


dec err int en 


Contrail 


DCC X5HW?i*-f*-^;Hfy h 




Control2 


DCC /a/2 l/y/y 


dcc_addr 


Configl 


#giJ7 H UX*- Kffi^-X7 H 


dec IvO 


Contrail 


V-i> 0~31 ffl ©"valid" t* y h Xf- 
*X 


dccjyl 


Contrail 


7'f>32~63/f!<0" valid" h*7 hXf- 
*X 


dcc_lv2 


Contrail 


7-f > 64~95 fflCD-valid" bf V V Xf- 
3>X 


dec lv3 


Controll 


7-f> 96~127 ffl ©"valid" k'*y hXf 

-*x 


dcc_raddrb 


Status 


> F^-tf-f *-f If B H ^ 

X 


dcc_raddrc 


Status 


> K*-^ tf-flf c H V 

X 



0 8 1] 





94? 


mm 


dec test 


Gontcoll 


DOC xXhl^X* 




po_cfe 


Cbnfig2 




po_stat 


Status 
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— 


38 


poerrint 


Interrupt 




po err int en 


Config2 




po_dmr 


Config2 


tea it i _ « » 


posubst 


Config2 




po_cdp 


Status 






Contrail 




po_said 


Contrail 




pojdr 


Control2 




po_muv_valid 


Control2 


muv W3!it*y N 


po_muv 


Configl 


MUVRAM C^-XT K 1/7. 



[0 0 8 2] 







mm 




oob cfg 


Q>nfig2 




OOb stat 


Status 




oob err int 


Interrupt 




oob eir int en 


Cbnfig2 




oob dmr 


Conrlg2 




oob subst 


Config2 




OOb Cdp 


Status 




oob Jen 


Controll 




oob said 


Controll 




oob tile 


Controll 






ooc cfg 


Cbnfig2 




ooc stat 


Status 




ooc err int 


Interrupt 




ooc err fat en 


Config2 


x5-ygy&*-f*-:/;i/i^x* 


ooc dmr 


Cbnfig2 




ooc subst 


Cbrrfi£2 




ooc_cdp 


Status 




OQC_ko 


Controll 




OOCJl&td 


Controll 




ooc_tile 


Controll 
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40 








JPEG ft 




jc cfp 


Config2 




jc_stat 


Status 




jc_crr_int 


Interrupt 




jcerrinten 


Config2 




jc_rsi 


Configl 


U Ay— rfwffl 


jc_decode 


Control2 




jc__res 


Contrail 




jc_table_sel 


Control2 





[0 0 8 3] 





mdp cfg 


Config2 




mdp stat 


Status 




mdp err int 


Interrupt 




mdp err int cn 






mdp_tcat 


Cbnfig2 




mdp_©pl 


Control2 




mdp op2 


Control 


988f|S2 


mdp por 


Control 1 




mdp_bi 


Contrail 


>K 


mdp__bm 


Contrail 


» 


mdpjen 


Contrail 














ro cfjR 


Gjnfig2 




ro_stat 


Status 




ro_crrjnt 


Intenupt 
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42 



B rn prr int p»n 






IE ro dmr 


Cnnfw? 


^ wfcT r 1^ *s -A ^ 


II IV/ JUL/i)L 


f""Y\nfi o 1 




ro cdp 


Status 




ro len 


Status 




ro sa 


Configl 




ro idr 


Configl 




ro vbase 


Configl 




ro cut 


Configl 




to Imt 


Configl 










A read only copy of PCI ^>7-4{fU— 
> 3 * 0x0 * QxD t OxF 


pci_extemal_cfg 


Status 


U *M^> >J T;l/ ROM ifl 6 ^ 
0 >D- H Lfc 32 If y h 7 He 



[0 0 8 4] 







IRK 




iis cfg 


Canfig2 




lis stat 


Status 


Xt— *XUvX* 


ifc_etr_jnt 


Interrupt 




iis err int en 






lis ic addr 


Status 


ICA^&©X*TKUX 


is doc addr 


Status 


DOC*eOAA7HUX 


iis_po addr 


Status 


FOAt&OAftrKl/X 


us burst 


Status 


PO, DCC &\CiP*> (0/ t— X h 


tis_base_addi 


Configl 


#x h^t'jv^yicwsnyaty 






»/*U*^x{r KO^-XTHU-X 


iis test 


CODfiftl 


^Xh^-H^X^ 



[0085] rn&(o^^x^+T»g-r^fe^«^ 

(a) tfi^tf-f >*k>>X* (i c_i pa£ i c_i 
pb) . Cft6©l/^X^^7»4Sft*frbTV^A^ 
0ft87Hl/X*»Jfrr£. iK«7Hl/X©#BfcA^ 

Xfc»5««-fctt5?^>^*^**fflV^tl-5, 
32tf^ h<9:>-^>X#^jWt#£tt> 

n^a^^^2 2 4i*x^ct>tJ2 0 fegbErlz&k* 



(c) ToDok^^ (ic_tdatic_td 
40 b) o £tt£C!> 1^X^7*4, f >^StlTV> 

^^©v->>->x#^*^-r-5o 

(d) -f >^^^hWi?X^ (i c_i ntatic_ 
intb). £*lS0W>X^7tt, <i>*77h£ 
*»*->--!r>X#^t*Mft-r*. 

(e) >f>^7/httil/^ (i c_s t a t. a 
p r imedt i c stat. b p rime 
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43 

(f) Uyx^7*tXt77*7 (ic_sema^ 
ic_semb) . *XhCPU2 0 2lt 3/Dt7 
iJ-2 2 4^©i«S£tt, li£l^±COlxvX^^coS 

ittt&i&Btb&V^^^7^tX©*^4MB#Tfc 
HfTT^^t^T^^o *7hCPU2 0 2^tV7t 

t^TT ya-fey ■y-coHfrA^Bff-rs CtT& 

l^X^7^-feXirv:7:*7te, n7 p Dir^it2 2 
4©**/ttMl^i?^^0 1 fcfy htbXttfcSttS. 

3. 4 SSXhU"A7t-V7h 

5. l^O*^ HJ-AtttUAx/W X#*ji 

^gcotS^X h U -A#ffi<2ifr^X h U-Atfttik+X 
**fc*fc3:/n-feylJ-2 2 4© ; E^jL-;p*flJffif 
*. C£X, *t>KX&Att. &gftffl*:r-**3l$ 

te*XKCPU2 0 2±TSbf^bTV^V7h^X7IC 
ioTtlSStl, 5X*W*7£'fe?P'-*#-P2 2 
0 y n -fe y U- 2 2 4 C J: o X^frS ft* £ 

(XHJ-AA) HU fa©ff}H*XHJ-A UhU-A 
B) J:Dfc*Vi«*«Tftfff*. ♦❖XHJ-A** 
M4+a-I^XhRAM2 0 3 (SI) 

/Sn?SIDSXS*U 7^U^r-: ^3>©*fr+tt*X h 
2 0 3©*8**UfcB£2tl*. ft*^tt*XHRA 

M20 sofiS^^uaiafciMAsnssLt^ffti/ 

<, ^x>li#d7 , ti4r^^2 2 4*^7Hl^XA^ 



44 



[0 0 8 6] H8tt, *XhRAM2 0 3tlI»|ft*n 

XV>£2^09X MJ~AAchBCD7*-Ty 

T&£o X hU-AAihB-tn-€ r n©7*"V-/ M4# 

[0 0 8 7] 3^nt7ii-2 2 4 llfcttSffiWcC^fr^ 

* AX HJ-A£BX h'J-A0 2^*M8X MJ 
-A 

X££ 

*£^e>*£>XHJ-A£ fnytn LX, XHJ-A 

* t>fr<D7s h U-A**ax*oXt>AVi 

* £%6*>0X h U-A*t*Jffl^«T*oXfeAV4 
*£-6S*>©XMJ-Att* r*-A7«; 

20 bXUfcltntf, *©*^-©^ffi rrt-A^y 
7*J bXV>5J;5&f^£^XV>X ! k&V> 
*#*^tt3 2 tf^ h©l-3ro«tt-r*J;3fc I"-* 

* &tfr^r KM* > * y 7 b ^>#^HfT * # ih S -fr* 3 - F 
*WbXV>Xfcfin 

tC, *^**635^i;»7xy5 1 bXfefiV^ 
A<fr*MMB2 3 5»4, 37 , n-fe:^^2 2 4©^*©*ff 
fcfflSfrSfc*^ &Bfcl$fc*XhRAM2 0 3**6 

3 5 ttffM*©tt#*ffV^ C B u s 2 3 1 ft^bX^S? 

[0 0 8 8] H9tt, ♦ftMM^ 3 5T*fFr*ifrfr 

4O0tft^f-?2 7 6-2 7 9*>&J5fc5. 
IlXf^2 7 6TH ft^XKJ-Afc^^T** 

40 >^^X*S«'&tC«, ^7i7flT2 7 7, 
«*lfcSrflC*frL2 7 8, l^v ? X^$Sifr^2 7 
9. 

3. 5 gftC97^7V^XHJ-A<Z)ife£ 
SlX<r-5?Xtt* 2o©Xxy7 , £Sfrbfttfnfc£& 

2. ^©♦*xt-U-A*Jkfc7x^^*)fcfoifefe 5 



( 24 ) 
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bTU£A^5A^ 

4. A&&M3BC9X MJ-AA*n^ LT^£A^3 
A^ 

1>5A*£?A* 

«TI^*T»3- Kfct ±>E;i/-;W;:*^TtfH*A* 
^>tM >yUT^5A>£5A>*fcj£?-£7;t':*UXA 
*Sl/fcbot»5. £©7JU:*yXAWU 8E»a>tt* 

[0 0 8 9] if X^-^-FTft^ HflH^-HT 
*0, /Vf/XX^Er-KTfcfc^ iSftW^-PT*^ 

i f AXHJ-AA*liJffi^-KT£>tK AO TAX h 
U-A©^-ir>X#*&*flc±'k *5W4AXHJ- 
Afc#<frA*#frf£j 

else ^(^>x-r >^LT^&V> 
end if 

else if BXMJ-AA*Dy£3ttTV>Tflcjt 

i f BXMJ-A*«i*»*-FT*?K AO rBX h 
U-A©^-dr>x#W*±+, MWlBXh'J- 

else ^&^>x>f >^bTV>&V> 
end if 

else /*XHJ-A**n?irSftTV>&V>*/ 
i f AXhU-A3W»*-HT*±+TftV^ AO 
rAXHJ-A(D:>->y>X##Atftlk<K **A*KA 

else #*tt'<>T J -f >^UTV>fcV> 
end i f 
end i f 

else /*< >^7x-^«W«36«aaftbTV^^ 
*/ 

end if 

Mffff 2 3 5^>f^f >^£A*J?.oA^£t l"xtf 
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1. £*>6A^<DX HJ-AA<Dyi?£tlT^£A> 

2. AtB©^hU-AK4:©«j6ai*«(t4SnT* 
0, »atmffbfc#^XhU-AJat # '&5T»-6A^ 

3. <!f££AM0X MJ-AA*SBftLTV>5A> 

4. £^£>A^CQX HJ— AA*^>r-< >y^*^TUT 
V^A^ 

'J-A*ft^T€>A^«:SbT^S. 
10 [0 0 9 1] if AXhU-AA*ny£Sn-0>5 

else if BX HJ-AA*Uy££ftTl>£ 
&(Z>X MJ-AttB 

else /*£5S©XHJ-Afen^snTV>& 

i f AXhU-AA^fj^-H, AO TAXh»J-A 

^A*#fiE"T5j , AO TBXhU-AA^iti^-F^ 
rBXhU— A©^— ^>X#*tA*(*lh+, *SV>ttB 
20 X HJ -Afcft^A«#fiEj j b&tf n«, *©X h U — 
AHA 

else if BXhU-AA***^-^, AO TB 
XHJ-A©S'-y>X#^A*ttjt'K *£V>ttBXh 
U-AtC^>x^>^^A^aE-r^J , AO TAXh 
U-AAHSEft*:- HT, TAX h U-AOv-y >X# 
#**flcjh+, *S^ttAXhU-AlC*^A«fiFfej J t 

^©XhU-AttB 
else /*£^6©XhU-Afc^A*#£Lfcl> 
*/ 

30 if pri=0 /*AliS, Bft*/ 
SfcODX HJ— AHA 

else if pri = l /* Aft, Bift*/ 
&©X HJ— AttB 

else if pr i=2 or 3 /*5#>F 
Dtf>*/ 

i f &«©XHJ-AA*A 
*©XHJ-AttB 
else 

40 end if 

end i f 

end i f 

end if 

3. 6 S«ECD7^x>f ^MJ-A«)7xyf<|r* 
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47 



48 



3 5$07]/7 xy?rty7Tft\Zft&rZ>t&£\Z\** 
*^fW»«2 3 5 H^*7iyfLSK 
[0 0 9 2] JKTO*ft**SlfcSnS«!:€rt^ ^l^7x 

1. yi/7x7^777^Tfc^) 

7 o l/7x^^/\'7 7 7C0rt^C0W5a , tt^, i c_s t a 

Kfy httft^07 r U7x^5 1 A^5?)bfc(6^-by h£ 
ft£. &*5, ft^M»«2 3 5<D\^t£Z>Vi?7,9^<D 

3. 7 *ff^ 

fcMM***J|U **ft*fr**:fc»fc3:/n*y'*2 
2 40US?X* 2 2 9*«^-r«». 
[0 0 9 3] 7X^H«3 7 r D-fety*2 2 4fc*ViTffl 
V>£,n£lft^7*-Ty Mi* ^©tric^hCP 
U2 0 2*>S0^tC«fcoTHfT3*l, *XM:S1/T 

2 0 3l:^tl, BlOPCI/U2 0 6^lt3 
:/D*y*2 2 4K*E£3n$Jfc», *MHiT?£**:tf 
/hl!fbr^#Tr»S. »*b<H, nyut7D-2 2 4 

S 2: 



^. Sic, 37 p Dir^-9'2 2 4fc*ViTHff3ns^ 

\z, rnfiWTfc^j fflSi::#bTfc:3 7a-feylJ*2 2 
4©ftfP(C*fLTffl^Vi»I»t*7. h v^xA**??*.* 

10 [0 0 9 4] HI Ote, -tn-en**3 2 h'y hc087- 
F*>Sj5fc£¥— lnM*2 8 0 7*-7y h£^LT^£<, 
&#frte, K 0*73- F) 2 8 1, 

H OfflgiJ fc^f 5 > H * 5 WMS** -f ^7- * 7 
-F2 8 2&31T. 3:3CO:*^>h*A, B, C©7H 
1/7,283-285 %, ifgHT H I/X 2 8 6 1 1 h\Z<5 
f.\Z, ««2 8 7fc, *XhCPU2 0 2*«ffl 

[0 0 9 5] B 1 1 tt, F 2 8 1 (9 

$jS2 9 0**bfcBT?*5. «^*^3-HH3 2lf 
20 y hgT, *^3-H2 9 1, «^yn-H2 9 2, 
-i>?y7h (I) tfyh 2 9 3, (Pd) bT 

yb2 9 4, l^X^ft (R) Hyh2 9 5, 

(L) t: y h 2 9 6 , 6^ 2 9 7 AH* 7- H 2 

[0 0 9 6] ^T^-t^ 
[0 0 9 7] 
[«2] 





m 


major opcode [3_0] 






0: Reserved 




i: ttimfiflH®** 




2: JPEG 




3: fr^SU? 




4: ®«3>#;l— 




5: £iftXtt 




6: 7*-?m<t 




7: A-7h-> 




8: mmmmm 












ii: titftmnm 




12: -&« 




13: 




14: Reserved 




15: Reserved 


minor opcode 

I7j)] aw-ya-h 


C tf>7 -f - A H OiWfctt major 






( 26 ) 


W 


49 




50 


I 








0= ^TfffclGtotffjh&L 





1 1 -8 5 9 6 9 



[00^8 





mm 


pd 


1 = "partial decode" $$§£&ffl-f 3 
0= "partial decode" t&Hi£®ffl 


R 


1 = Pixel Organizer's Atljk V i?X $ (pojen) \Z J; 

^* 


L 


1 = Z. CO^T. h U — A(A or B)«#£>ffr£ ©fcfe 
flocked" ^tlS 

0 = Z. <D^7. b U — £(A or B)tt^©^©/cfe 
lC"locked"£*'L£j:U 


length [15..01 





[0 0 9 9] I tf ^ h7-f— JUF 2 9 3 h~?%>C 

y7°h£ ft flat T & i: o Kffrfr £ 3 - H lkTZ> Z. t 
#6, &45, UGH 57 Mi r^j»7-f>^5^ 
H fclfttn*. -»SE^tyh2 9 4tt, -Wtt^bf 
y h2 9 4©h:y hifi'tey hUtl, i c„c f g WTs 

2 9 6tt, JM&K*fcD loK±©tfl^6ifcSi-r*«! ! 

3ft££, ^3WTUfc»*r*©#*3JfWCX h u 

-A^&7xy^£ft£. ft*7-f-;PF2 9 7Hft* 



30 



1 6h*y bfiTSS. 6 4, 0 0 O^gJeU^A^x- 
2 9 5**-ky H2 0tf^-fe;P*-^>f1f 2 4 
6^po„l enl/^W^AMm MSttl' 

[0 10 0] 01 0fc*v>T, fcSfc^fc&Bfc*^? 
>F2 8 3-2 8 6©fttt»V^#*^-f ^fc«i;T?I 
aT®*tt, ***** y^tC*^5>H 

[oioi] *^5>k**;/ 

[01 0 2] 
[S3] 



51 
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$ 3: *U>F<D9<17 



1 1 - 8 5 9 6 9 
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toft 


Length defined by 


#of 
operands 


Compositiag('a^) 


input 


pixels 


3 




input 


pixels 


2 


JPEG W^/EBSS 


input 


bytes 


2 




input 


bytes 


2 




output 


bytes 


2 




input 


pixels 


2 


Halftoning (iBgSI) 


input 


pixels, bytes 


2 


Memory Copying (/^U^tf-) 


input 


pixels, bytes 


1 




input 


pixels, bytes 


lor2 




fixed 


fixed 


2 




fixed 


fixed 


4 



[0103] H12H 3*^5>Hft^iC»*r«Hl 

-F7*-vy h3 0 Ofc, 2*^5>h*A^fc*rt£ 
x-^7-F7*- h 3 0 1 t^LW§. 



[0 10 4] ^5>h*E3fi ; f 

[0 10 5] 

»4] 
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KM 


what 


to lM^-h#liW4^ttM^W*. 

w _>-_^ 3 _ h * = 12-13 :^7>KC3b^h7^« 

±^-^3— K =14-15: ^Pft 

3:-«X-$ 


L 


o=StviS}Sir— £ 

1 = &l> : -f — S^CDtK^ 


if 




S 




C 




P 




bo&0] 






[0 10 6] ±^®«ic*v^t, -^f-^rFwt 

©Jg-g-Ckt 3^at7-9-2 2 4^<^«7-i'6 

B A**'* 7 > F 7 - h* 4" tc#ftf 5 £ t 5 . 

t0 1 0 7] SI OIC^WT. -W£n©:*^7>F/ 
^r;_i<2 8 3 -2 8 6 FB#©Mfc5 



2 jUtti jiH^'i 




**^3-H (011) 
40 ttX5-*^T-5. Jfcfc, CBu s 2 3 1 £:frLT« 
*©l/57^>*K5fe-r5Ctt:J;t), ^»32 3 5 

[0109] &^i^-;p©i^x^ttiWiM;:&i;T^ 

flour 




"4ft' 



( 29 

55 

9<i -f c o n f i g 2 ) ttT^TO^y 

S-^-f 7 c o n t r o 1 1 V 10 

3o Control 7HWa&t'«* 

l/^^©»n©^>f^ control 2) 

[0 111] Mk<bWJ7s99<-f M>*7?buy 
7s9) \t. U^^^^V^-MZ^XWZ^y 
hStl, ir-y h2tt£h> Mr r l j 

[0 112] 2 4©» ; &ya-;H4, ft 

^SUfT+TTfcf^-ttlB^t^fctt, CBus2 3 1± 
<E>c_a c t i v e :7-f >£iry hf <5« ZL<Dfz£>, ft 
<frft![flHB2 35 a, CBus23 1 jKDtti? jl- 
f,C!)c_ac t i ve7^>© TORj £tD, ft^nrt* 

tiffl*:V3L-)V2 3 6 tHa-f >^7x-Xl6ll ; t^ 30 
— 3 7 fctt. *-rtyy7ftit$:nf7?Z>Z.£ftT! 

*-/X5y^*^**ffr*IRtiBiW*c_ba 
ckground^^l^. ;^5y^ 

[oil 3]-*-;t7y^n-*^DMAft*©*ff* 

2 3 5 3Sf»|lK:*-/X?yy#**«*ffSnTV^*S3 40 

oWE-FfcfcoTV>n«, t|r*«*«2 3 5fitT< 
SB2 3 7^D-*;W ; EU*lfSI«2 3 6*Mrictft** 



#B8¥ 1 1 - 8 5 9 6 9 
56 

ffi-sf £ffl;rr5. r*-A5^yft^»7j 
[0114] fitftifiQim<tiz>t* ft^awffiaaaE© 

^<£ftfrea, ^07i7f, m^cto^ftfr^ 

2. 3ffi^4»©«^#^*>:/**Tfcv> 
4. ffitc^>x^r >^bTv^ft^#fi-r^> 

/t^^rswaic-rs. ££Sreiiyi*iii»st, ft^ 
$nra2 3 5ttSft*fr+©#«*^7i"s*THfflfe 

-TSilt*^<, Sttft^OlR7*CBu s 2 3 1±© 
c active tc b ackgroundM>& 

3. 8 ft^«d»BB©l^yx^H©3R 
ft^jWR7T5t, ft^M1BHB2 3 5tt*fbV^«tt*Jg: 

1. jB«jai/S?X^7^-feXtV7*7©A*. t77 
*7**A*IH««B2 3 5©^«©x--$?x>hfc.fcoT 

2. a«ai/^x^©HHf, ft^a«*^^>^ft^ 

Tfcl>»£fctt. ***^f (i c_ i pat i c_ 
ipb) 6**©U-fX»*ttS«. 5?ir>y*^© 

-5c fcl, v-^>XS^^fi : e- FT*nH**7 
l^X* (i c_f nat i c„fnb) ttftflrfSZ: 

[0 115] ttttl^Xjr (i c_ s t a t) fej&rbV* 



( 30 ) 



1 1 - 8 5 9 6 9 
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3. lZUyZV-iZJmffl. CBu s 2 3 l±lCc_ 

Tftt>^ «S7U>?X^ ( i c_f n at i c_.f 
nb! - ; - ' >^7/Fy-T>7^ , _ 

b. *7Ufc*^*«*7«rj«T<>^7yh"r*J:5lc 

3. 9 I/^X^7^irXi?V7*7©irv>x^ y£ 
t^X^7^-feX-trV7*7&, **®*^IWI^^ 

1. tf^TtW >^ l/vX^ 7 (ic ipatic_ip 

b) 
2, 
3. 
4. 



58 



10 



20 



ToDoI/vX^ (ic__tda£ic_tdb) 
*S7l^> ? X^ (ic_fna^ic_fnb) 
-f yZUYWTs* (ic_intai:ic_i 
n t b) 

5. mtiLV*J7s9*Q>tii±}iy V ( i c_c f g) 
ftfflx.-i?x> h\t?^TOUi?X*£^^b£±\Z 
^^.h^T^^o Sfc, JMJx-SJx^httT^T© 

p^x^fcvo-cfe»*atfc:t««T*«^ 30 

952 3 5*^nS©l/^X^t©tt*3E«fUTlx*t>a 
V7*7£A#L&Wxfcm£fcV>o 4r«fHfftfa, k 

ft, ft&mmm 2 3 s -A&*mrr*fc»£ 1 * o 

£0 

[0 116] *ftfc©J;3fc, 5/-fr>X«#W**ft*- 
KTfctl«, #**Ktt3 2 tf? h© r>/-^>x# 40 

JDLTV>#, 0xFFFFFFFF*6OxOO0OO 

0 0 tf>ysn£. 

[0 117] ^«^6©»*ii*3V>f >^9^M/^7, 
^ (ic„intatic_intb) \Zte£tlZ>t, 

: ^©M*&^ in 



JR7J fluffy h ( i c_s t a t +©a_p r i me 
d£b__p r i me d t> h) hf^^tT r-> 

»t-/t7^7^y-^r>7,|§ ( i c _ 1 o .1 
^5l-M-4 i c_ 1 o . l/yZ^ 7 ^: -(iS) <tPH-- /in 
tt\ ift^fHPffita i c_s t a tl/y^^tOa.o 1 
__pr imed$)^^Hb_o l_pr imedh'y h 

3. >^ 7 / b y»y >Xf§ W7y-^r >X#§ 

4. <>^5^h^-ir>^#f*«7^>7#t 
J:0^> r/h$<j .■S8^hU-Ai:*v^T*-A7y 

n\mr^^r^t\zuo, -o$y7vmm\*m 

[0 118] W^x-yx/hH ttfiBU'^X^*© 

-Y >^77h*Pflifc:y h (a primed, a ol 

primed, b primed, b ol pri 

medtfyh) *t7ht5itm> <>?y7h 

3.10 tfH^fWSP S$ 

si 3"n, ^mmm2 3 5*±di*wc*l&h-c* 
*«*flMiJ2 3 5tt, *^tff*-f^;nfe»ab3 

yDt7t2 2 4 ©^*©*ff 

SB 3 0 5**0. SfTffffiSB3 0 5ft **Mffffi2 3 

^g^S§3 0 6£fifx_5o ft*ft*83 0 6tt, 7k7 

**fl»B3 0 6HU ^©nyniryiHEv^ 

5o 7k7xy^Ay7r$4fflS$3 0 714, /l/7iy 
9 L /1y7r««HB+©7k7x^9 l /t^7r36^©a* 

a*^**a***3i'r«tt , bfc. ^«^S3 o 6 

tAW>^7i-XX^f7f2 5 2 (02) £©ffl© 
^>^7x»X^fetmo ^7x7^7 
7r«««3 0 7»4-t?©*^#<>^l/^X^ (ic 

_i pat i c_i t>t>) ©Etfftfeiffe-r*.* 

#2 3 5, ^©4s?i-M&§ 
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XB^fflTOMfSftfT-S TCBusj H#«B 3 0 8 
V^TfTfcnS. I*HCBu s 2 3 ll:J;oTl^OT 

[0119] H14H, Bl 3 0^fT$ffl^3 0 5 £J; 

1. *©*^*^O^XhU-A^6Sl0ttl-r^*:ft 

2. SiMM^^x^feBBteU 10 

5. *M*©K7*ft5£U 

6. ft^*7l/fc6py7^*Ii«. 

[0120] mnrnmrnt, ^#©#^ff*-f 

*) Hi 5(4, ±ifi©**2SfTlK£;i'** 

it^«3 1 0«»a»H**b»:H7?*«>. HI 

mmzftv. hi 5©a»Hfc*v^TB8*6 3 1 2&<&tf 

C^7iyf3 1 3tt«tt, £<0««*fflV>TlflM*& 
A^f£o HI 4©l^X$M?g«3 1 7(4, WSOtit 

S&gfcfr?* *fc, JUTl^^X^ (ic.fnaii 
c_fnb) t^(>9y^VV^7.9 (ic_inta 
t i c_i n t b) fc*JfcttU F->— 5r>x#*K 30 

fc, I/$?X*««*3 1 7jWt3. sec v^7,^m 
m 1 7tt-f ^^MMMfflMrS. *-A5y^ 
**«3 1 814, i c_s t a t l/^X^+Oitffl&tt 

©W*fr5. 3Wr**P*tt, Et+**3 1 OfcHl 

3©<MWIS3 0 6<h<Z>H<ZH >^7x-X£fr<5« 
*t-f>^7x-XSI$3 1 9£«;L£. 

to 1 2 1] HI 614, **tt#tt3 0 6^J:r)»lc 
^bfcHT*^o *^#»i:i:/n*vtf***l,T 40 
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-^r>lJ-3 2 1 ^v-^r>it3 2 1 tt, & 

y^*3 1 2ta«r<5. 5r>* 3 

2 1(4, #^©WSStt^*^©*--/ , «7y7 r ttSfti:© 

[0 12 2] HI 7(4. HI eco^x-^X/t^^^v 
-^r>^3 2 i£J:9f¥»c^bfcHT£>6 0 

XAy^-\->-^>1t3 2 1 14, ±fc<Dz/- J T>7,Um 

«sbv-»3 2 4 tmmbrz=t^^-)mm&^-'r> 

•y-««V5/> («;Ui3 2 5^3 2 6) £fix.£o 
^-;U^«^^-^>^^T5>>t4«^T^#&^v 

«l7y> (MIJ3 2 5) 14, CBusf^XA'^ 
■HC^ttOCBu s &$misTm*<D\sz?7>* : £'tey h 

5V< ^Dyn^^5>^S<h^V"^>-tf3 2 1 
[0 12 3] H&iCte, fJM**-^— Jr>-*3 2 1 tt* 

?3.-;WBrt«Wsft» tf^ir;^-^f^if2 4 6 (p 
O) , T-^tr>i«filHS2 4 0 (dcc) , ^ 
7>H^-^-<ifB2 4 7 (OOB) , *^7>F*- 

If C2 4 8 (OOC) , ±f-^/U2 4 2 
(MDP) , »**-^-fif2 4 9 (RO) , JPE 
GX>3-y2 4 1 (JC) tt^S^- 
JUfci'bfciRbTH*. &43, 7X-XMB 
$2 3 8 (EIC) . n-*^*U*W«2 3 6 (L 

mo , nrtmm&2 3 5 (i o , km >^7 

x-^7f2 5 2 (IIS), m^^=L-)V (M 

M) fcgO^a-Mk ♦*«w«4 , fctt*lj*;Stt 
[0124] **?*-)Vtt±tT0# 

[0 12 5] 
[*5] 
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Jr>X ID 




po nrr priv* npir* k/tm> do 

r\A UK^-l*, Ijwd, IvllJr, KU 


1 


LoL 


ru, ULL, UUd, UIX-, MDr, R.O 


2 

2 




ru, UUd, UUC, JvJ, RU 


3 


ij'.tj cooing v j y vt^j 

it) 


ru, ^./laJ, UUli, i , JC, R( 


3 




PO, DCQ OOB, 00C MDP, RO 


2 | 




PO, DCQ OOB, 00C, MDP, RO 




Halftoning (PSaft) 


PO,DOC,0OB,MDP,R0 


4 






8 


DMA 


PIC 


5 




PO f DOC, OOB, OOQ MDP, RO 


6 




PO, DCC, OOB, OOC, MDP, RO 


4 




R0,RO,R0,RO 


7 









[0 12 6] B17fc43l>T\ ft^^jL-^fiK^-dr 
(W*.«3 2 5) tt^K^W^^T^HrXffia* 

[0 1 2 7J HI 8HU ioatfctEoTWaTfi^E^a 

B»H3 3 0-C*L&HT»*. frES^-JMMSS' 

>yr*r§fc^{C, CBusf-fX/Vyft*SfflbT, 

[0 12 8] HI 911. H13©^J7i*;fAy7r 
H»^3 0 74i:0^lC^UfcHT*^ ^J7xy 
^/Xy 7r«W»tt*-©3^UtylNW' (6X32 

3 3 5*11*.*. -tLT, yj7i^f/b77«IB 
u s >■» 3 3 6 fc •t^TftWStl*-' 

ftfrMMk **tlt!P*CB 

^-h£ffi£3o IBus^>*3 3 6ll, yU7 

*7xyft-5fc6l:7 Kl/X*4j*T*7Hl'X«3 
«3 3 7ftfc«fc*. 7Hl/*ffI«3 3 7tt, i c„ 



flEt, ic_ipatic_ip b l/vX^lCv-f >7 P 

3 3 9te:/[/7xy^W7 7$1SP8C3 0 7 05^fl£©Sl 
3. 1 1 ^a-iPD-^;H/yX^77<;«i 

hi 3\z^kLtz£?\z, ^um^y^-)vs»i^ts 

fi-^^a-JKi, H2 OfcjRLT**CBu sf>^7 
x-X«ra3 0 3 tit>fc±3fibfcl/^X^3 0 40 
30 rtfflHry hfcffi!*., CBu sftft*3tt#tt*ifcfcfc 

; E5?j.-^OftWtt, CBu s< >^7x-X3 0 2$: 
:frbT*5*x-;i'*<D 1/^X^3 0 4fc»*attJlifc 
.fcoTfrfetlfc. CBusilI**3 0 8 (Hi 3) 
#^K»»2 3 5, W>?7x-ABM $S£* 
^i-W^tD^^i-^CB u s CBu 

[0 12 9] H2 0tt, ft^^jL-^K^ViTfl^fetl 
40 ^ C B u s >^ 7 x-A 3 0 3 «i^*SLfcg 
glipCBu s-f >^7x»X3 0 3«CBu s 
3 0 2*6©«*fflb«d«>«*i&*B**fi«T*4: 

3 4 l£^bTH^£ft£l^X;*7 7-Ol'3 0 4£{| 
AS. MIC, ^^UIWROttJSLfllUft^tHJ-^ya- 

TV>£« fe¥CBu s-T>^7x-X3 b 3teciBu s 
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[0130] T c _rese t J ft ^ 3 4 5ttS*CB 
u s-T>^7x-X 1 0 3rt©^T©l/vX^£x7 
XJVhftmtty hT3« l^U r c _resetj 
It tiCBu s VTsf-tOfflOlS^O^ DMD&Pl 
at^l7y>liUt>;hl^K -t©fc»* T c _ 
r e s e t j #CBu s fflS4M£i£ffl;*ftfc£ ttfe, 

c_err = ^ error[i j AND err_mask[i] 

em>r[i]no1 reserved 

[»2] 10 [0 13 3] 

c_ int « ^ intemipt[i] AND int _ mask[i] 

inl errupt[ j ] not reserved 

[*3] [0134] 
c_ exp = ^ exception[i] AND exp_ mask[i] 

[i] not reserved 



intj 3 4 7, Tc_expj 3 4 8, Tc_e r 
rj 3 4 9<i*tH HTOiCSJ^Ttya-Jk r 
r__int<herr_int_en WT^^ <D\H%-& 0 

[0 13 1] 

[0 13 2] 

(1) 



(2) 



(3) 



{g^ Tc s d a t a inj t l"c sval i d 

inj 3 4 511 ^^a-M^tT^fJiDtya-;!/ 
^6Ot-^/WM§T?*0, ffi^ r c „s data 
__ou tj t r c _s va 1 i d_ou t J 3 5 0 H 20 

[0 13 5] g|2pCBu s-f >^7x~X3 0 3 ©«Sf£ 
£LXtt«T©<b©**&*tt5. 

3. fXh ; E-K^ffll//i^MfI 

4. *^5?j.-*oea/«f»a 

3. 12 l/5?^^K*HJU/»#a**l 

g¥CBu s< >*7x-*3 0 3fctCBu s ±fc«Etl 30 

ft5tff#W-45. WPCBus-f>^7x-^3WRI'r* 
C B u s tit^t LT&TO 2 mm$>Z>* 
1. 9< 7A 

-X3 0 3rt©W?X*fc:U 2, 3, 4/HFSM 

>r -;m*^n-en rioooj t rioou 40 

5. ©JpCBu s-f >^7x-X3 0 3»4#**a#b 

WHUIMFCMU S*PCBu sO?7x-X3 0 3 
H CBusMO TregJ 7-f -\JH«*JliV>X2© 
W?X*tttfjfc T c _s d a t aj 3 5 0 

***ai*r*. #*&*»ftnrfct feme b u s 





>^7x-X(ix— ^*HfiHWr(c r c _sval i 
dj 3 5 0£3ifi!-r3. »*&*»ffc&«7'fS£* 0 
2pCBu s-f >^7x-X 3 0 3 \$ r c _sval i 
dj 3 5 0£j£fiiLTjg£-r£o 
2. 

?i-;wie-; h*-6vwi*sktfy h»s&trtwp* 

[0 13 6] g$CBu s-T>^7x-X3 0 3 

V>5**H^£. Tregj Tbytej Ten 

ab 1 ej 7-f-JH«ft«*LX, *K&-f ^-^;WI 

7 -^1***0 IB 

3. 13 **y«^»fflu/»*&**s 

g?CBu s-f >^7x-X3 0 3&CBu s J:©** 

u/»*&#»**3W»w-*£, g¥CBus^f>^ 

7x-X3 0 3f4B*^^j.-;i^©7Hl^Xft»UT 
^LT> ♦*©7Hl/X7^-;VH* 
«*rr£££X, I¥CBus>f>^7x-Xli» 

a-;i^©7 H h n-^fl-^ 3 4 4 

«#&*«fFCHt, g¥CBus^f^7x-X 

[0 13 7] gfCBu s-f >^7x-X3 0 3 ©»# 
H Cfeu s 3 0 2 J:©CBii s#^©£^:/7^-^ 
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77^3 0 4«hffl*-trl^*3 5 3t:SW^-y 
2(ZJ:oT©J®!$n^ 0 CBu s^^v^x^^^aj 
*)-fel^*3 5 3 £i-*-7MMcU r c _s da t a 

SIM 3 5 2v;^yX^7r^iU 0 4&-f*-7)UZ 
^ Uxg7©U~F/7< FT&ttfcf, tt*ffiL/S 10 

-r;u3 0 4tt, i^yx^a*?a#«B3 s s, mti^u^ 

^ 3 5 3, -f>^^7 P h 3 5 6, 17-3 5 7, M^3 

astt3^5«i*sn*. i^x 3 5 5n at 

*fflb/»£&*lftHMS3 5 2*>S<0flHt r r e f_e 
nj (l/^*7r-fK*-^M T w r i t ej 
fregj *«*|U *S^^X^*-f*-^;HcT« 20 

^ 3 5 3 H, K*lHU/«€fa*«»«3 5 2*6©fg 

»(CiELV^yx^x--^S:iSRbc_s d a t e_o 

[0 13 8] «^**«3 5 6-3 5 9 teA^lCX? 
-*«*iasn&6WAx5-fl^ 3 4 7-3 

49, 3 6 2) #m*x5-€inr-r** 

[0 13 9] W?X*«3 6 0WU *5fc43V>TWy* 30 
3. 14 CBusM 
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C B u s «£JlT©-^©BWS:^r*r 



MifiOiiO, CBus (flJfflA'X) tt, &tv?a-j|/C[) 

1. S^ya-i^Mt^W/U 

2. RAM, FIFO, S-^Vi-^+COtt^lSffi^fc 
»©7£-feXAX 

CBus tefft^-7 Fl/X-x-^7 p n hai^ffl^ 

T, tya-Jh • --^ I . V&'^y ht : - : . '^tiZ^ 

SS*t 5 21 1 1: J; Oft* ©t^^-M^ 6 R AM* F 
[0 140] CBu sttHT©3o0iTSSaM;:«fcOffl 

1. *^*ffKFO***l«ffl2 3 5 (H2) 

2. ?-^7 h Ul/-^) Fmftff^frB* 
«K >^7x-Xf64#Pgff2 3 8 (H2) 

3. B u s >f 7i-x««i)SSnftlRi:ttn 

[0 14 1] «T©*MU #astt*««fc*ViTffiV^ 
©Kit LTV>5 CBus OCDjfcftSSbft'b© 

[0 14 2] CBusfffSK 

[0 14 3] 

[*6] 









c_iadf31:01 


source 




c valid 


soutcc 


CBus ifr^^f® 


cjKbdaBlsOl 






c svalid 


destination 




c_reset[15:0] 


source 


h 
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c_aciive[15:0] 


destination 




C_backgiX3und[15;0] 


destination 


^i^zJk—)V^^(0 background 7 
h -J- j 'j^-i V 


c intf 15*01 


dp sti Tuition 




c ciTorf 15:01 


destination 




c_repl,c req2 


EIC, external 




c_gntl, c_gnt2 


IC 




c end 


IC 




elk 


global 





[0 14 4] CBusCDc_i adflWH^T" 

1. c_i ad±TCBusft^7HW*I13n 
%>$t^<H7)V (c__v a 1 i di8) 

2. c_i ad (»#&#ftff0 ^c__s data (tt 

(c__v a 1 i d<£) 

fPtf)*S-t4, r-^U-f ?;P)W»7T5 *TK*IH UK 
ft<D?-ir*y h^x-Jl^c^s d a t affi#*B»l 

[0 14 5] H2 1 fc*wr, /Uli3 2\iy h<DAii 
-7KUX-x-^7^f-;Ph^^o ^C7)7^-^K 
ttKT0 3r>©*>f 7 (3 7 0-3 7 2) 
1. (3 7 0) t4* 3^n1r^i}-+©I/i? 

F p c i im ^;w£#£frbTv>£^«H >?7i- 



20 



30 



[0 14 6] ^nSOMMTCtt, ftHHK^;P©a»<D 

2. 9^^BWj^ (3 7 1) ttfcWt-KTfll^Sn, 

t-KPC I+M^£^frLT^S^gK^>*7x- 
X«»«^<BC Bus>f >^7x-XlC<toT*ricS 

fcft<, x-^iM^;W4c_s v a 1 i 'dffl#*fl!V> 

3. ^-ryCidfP (3 7 2) l^^a-Wl/^X^t 

W®2 3 lH«CBu8>f>^7i-XCJ:oTt* 

[0147] &^o^7'7^-;uh(4, siTomz 
ao-crerr* cbus «a*«F^t u/tfccDTr**. 

[0 14 8] CBu s%M$<17 

[0 14 9] 

«7] 



cJacUypeOti 






0000 


no-op 


A.B.C 


0001 


reserved 




0010 


peripheral interface write(£!2H > 


B 


0011 


peripheral interface iead(J322'f > 
*-7x~XKl$l0) 


B 


0100 


generic bus wrfte(ife&&$jA;*§& 
*> 


B 


0101 


generic bus read($§&»AXM 


B 
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0) 




0110 


local memory writc(D— 


B 


0111 


hxal memory read(D — f} Jl/J*^ 'J 


B 


1000 


register write( V $ 


A 


1001 


register read( V 9 ) 


A 


IOIG 


module memory writ^^is )V 


A 


1011 


module memory read^ "/a- )\'^ 


A 


1100 


test mode write(:rX h^^ F^ih 

*) 


A 


1101 


test mode rcad(xX b^— 
0) 


A 


1110 


bitset (try h-tr^ M 


C 


1111 


reserved 





[0 15 0] /Hh7^f-Jl/H(l Uz?X?*<D\iy h -TifB 2 4 7, X^y > rffrhf-tf C 2 4 8 -\<£>A 

*iry h-r5fc»J;:JHv>S*l*. ^a.-;^^ — ;H* A7t-v^hm d n 6 co*— j5f ^--f if tt. A^7 

\!tCBus}i<D&fi<D7\ t l'7>ft*i?a~)),&mfe?Z> 20 n«7tf™V*/hfc, J P E Gfr#{b»2 4 1 

£1-37^ -;i/KT*§, RAM, FIFO&£07K #:Mlf 2 4 9 **tti;/jrt«57 h£9rStt)#7 * 

Ma:^*^**fflv^nfct#ica#isnfcA-< [015 2] mfi«n?tt, ftm?-*?*—-?*/ m*3 

ii^lfyhf-?^. 4, 8, $>Z>W$1 6 htf>:/;W>&j80SLoTV> 

[0 15 1] MJfiOiD, CBu sttft^a-j^t 30 Sct^^a^X h U-A*6ric-5x-^© fAy ^7, h 

c t i ve5-f ■b;Hc*»snS7 ? -^* ££&£nfct:> 

ViTA^oiR7WF*»sc:t*«T€rs. CBus ST5lRfcJ8v>fc*i5. nyniryif-teU wwx 

TSc M background^>^ U-fey K X X>x>f 7>lf y h7 Fl/yS^yftJB^*. 02 3tt 

7-fcfti* -f >^77 r h*fr'5fc«e>OU-fey K X.y rty#XhV-&7*-VyhV>m-<DM$:ttLT^ 

3. 15 n^DiryiJ-x-^^-fytx-^f^ tfc2tfy ho^^^;po> ?**;n, s 1 **;!^© 

EG»tfflS2 4 l^x-^/^X©nyn*yiJ-4 , 0 40 -^E@**3 8 8T£>£« H2 4CD^^3 9 OTIJ, 

i»«4WB31»f^*«r*fc-r*fc»» n^D-fcytftt* &x-^*^x^h*i3 2tf7K , 7-H^U, 

«7*-Vy htrt«7*-Ty htSffiMflrfSx-- ^;i/^^:(C8 tfy h^TT* 4 Y 3 9 

j^tfJ^fl^*. ^BBx-^7^-Vy 5^$ntV^, H2 5©*HOW3 9 fcfy 

;M ; E>J-f>?7x-X^PC I ;Uft2©3^Dty h7KPA3 9 7 ri^^i;^**;^ fc K 8 t: y h £ 

iJ-©n«<>^7x-XKSfcftSx-^7*-Vty K Wnft^t^xi/ h3 9 6)Wft;£ttTV>*. fe 
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QtbT&ZtlT&t)* §7-HtO*-/Hh4 0 0' 
<0#**ffi testis. 

[o 1 r o ^fc^ffix-^*-^ H4 rfflj 

/N7V>^#{b*&t*C0*$&aS©X-^"T! t * 

[0 15 5] a7oi:yiH44o0rtffix-:^<7* 

T&D, »I8©3 2fc^ K7-H£I&^T47^t^7* 

hco3 2 hfy h7- KA^J5£S7*-Vy 
5. 02 7 1;:, 7-W4/Hh"r*5A7i7;Hh7 

[0 15 6] i2 8i3«©r-^^<^ rtr^-fe 

CO 3 2 tfy h7— H4 0 3 ^£j&£7;*-Vy 

[0 15 7] 0 2 9\Z^k'?&<DftffiT-??'f'7\t TJ 20 
>AW£/Vf hj 7t-V^h^D, ft 7-1**4-0 
©7£x>f 77VT h^*;P4 0 5 tHO©#7^f^ 

[0 15 8] ffi©rt»r-^*^x^ M4 Tffij 
i^7*-7yhtLTK»SW. **SB7*— ha> 

3 011 a^o*-»^-ftfKJ:oT*frStl*^«S7 
M 1 0*6X*7*-7y K4 1 1^©** 
#«feKbTV»*. 03 1(4, »**-jy^-f1f2 4 9 30 
K.fcoT*fT3ft3rt«:7*-vy M 1 2*6^«7 

[0 15 9] £CF, *l»**ffr*jB?I*J:D#IBfcK 

its. 

[0 16 0] SfA*r-^«7*-7y hA>&rt« 

7t-v^K©^ftT»5^ mz2\*&wm\z$> 

V>T@* CD *-iJ1-<i If IC «fc r> Tffl V> £ ft* ^JSfeS* 1/ 
T^5. (4D&t4^«$ffi7*-Ty K4 1 6T***t, 

\Z, ftffi7>rtyirrt<t h7*-7y h4 1 7lt 7> 40 
/^£jEjfi{fc4 1 8*froTrt«7W^^/1'f h£Pf 
Hft57*-7yh4 19*tlSt§. 7>/W^jE& 
ft4 18iIH n«7>/Xy^/t-f hXHJ-AjfcS 
#7*7V73/Vf h*ttDHK«8«r5. ^3 3(4 
7>/^^^i0fifc«31ft3Rb&"bC!)T?*S*^ 4A<h 

^^tfrrftx*©^* low h?**^©* 




S/Wh*HJ-A4 2 3l;:*S^*«a£fr3. 
f48hfyh<tirffifflsn«. 4lf7-*tS*A 

hWi^C^DlST^n^. XMJ-A4 2 2T 
V#-h$nz>Atl*7>>^Z hV-*(X\t, 1, 2, 

4, 8, 16t'*> ftft, dtlSH * 

f£0^^iiffl$n^vy;xA4 1 ^x-^^7*> ? x^ h-^7 

[0 16 2] B3 4(4, T**)VZt\Z (0 2 3 £>r- 
*7*-Vy h 3 8 6 iTt^ck^tr) 2h>bWT£3 
g^^M-T^x^ h»5iCDA;*rr-*4 2 2**A;*j£ 
nfct#©^y^jEjgft4 2 1 ©fil^SSbTV^. £±1 
Ax-^H/Vf h^V^t-T'V h4 2 3fc&^T 

[0 16 3] B3 2fci5V>T, b^ir^X h U - A(4^ 
/V;?jB14 2 5, 7>/b?fiI4 2 6, gift 

[0 16 4] 0 3 5ttJ%yi/Hm4 2 5©«*^Ufcfc 

ot, *fc#7^T--f^;u h?**^*nfcB»*>*L 

7h'J-A5!i^n^if^lTl^ 0 IP*,, 

hX HJ-A4 3 0**7-K££©47£t' 
h*ft57*-7y h4 3 lKJEKSft*. 
7>/V^i!li4 2 6J4e«A^^fflao£#OjD.aT 

03 6(4, /V^/M^h , J-A4 3 3*«7>/V^ 
£n*§S4 3 4«t»6ft*« : FS*bTV^. 

[0 16 5] 03 7H»3iaiR4 2 7»at5SbfcfcO 

t&d, N**fficrt©A*?ir*;i*t-r*t, a* 

^ftStt ryuh^-fytf^^j , #«;Ltf4 3 7££ 

rit-r*t*fc«v^6ti«. b^ir;i/^*;K4S 

/h/W ha>6«»Stl*. 03 8(4, #5t4 3 6©A* 
x-^««K*Wi«4 2 7 icfcoTfcifcStu 7 p n 
-f yt!^-t^7*-Ty h4 3 7*»4**tl*#^*iS 

[0 16 6] »*Wl3Wfton*t, S9RAW&94 4 
0 (03 2) jWrtoti*. 03 8 (4S^A#®S^ : F 
^^bfefe^T, Wr-^^X^4 4 1l:^n 
fc-£«TW««ftAtt#*.» «OJ:5fcHl*B*2 

4 2 ftferiE-r^w^eiSbTv^. 

[0 16 7] H3 2C^T, JQ.aS4 2 5, 5 2 6, 
4 4 0©tB^(4U->A7!yXJaa4 4 4fcSI6tlSo 
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[0 1 6 8] H3 2IC*3V^T, lx->*7 y 4 4 

©MS 4 4 6 d^ffJiC, 7jl/fJ-XhlRAM2 5 
[0 16 9] M1I4 4 6temiCx-^7*vx^ h 

10 

[0 17 1] ^4 2ti, m*rt«7*-V-/ h 3 8 3* 
5fflM«f7*-7?/ h 3 8 4fcx-**£ifrf -5«* 
;t-#±-f1f 2 4 9 0MS£^bfc ! fccE>T&£e "COM 
ITU H3 2»C3Rbfc**ffl31i:H«©«ia4 2 4, 
425, 426, 440 MS 4 5 0 TfttJftC 

B*#a*?4 5 1, *IE«{b4 5 2, ;Hh7H^» 
^4 5 3, f^M7X + >y4 5 4<2MiI£#/uTU 
B4 3fciKLfcI^#SRi&3i4 5 Itt, 03 7<S> 20 

B4 3TTWU A**©3'D©W»^ + * 

[0 17 2] H4 4t^bfc*iEfflft«LStt, 834t 
^iEfi{fc;ffla4 2 1 CD««R#©»fP*-r 

[0 17 3] 04 2O/Hh7H^»^ill4 5 3 30 

Oft/h 2 fc> h #7 ^ h U 

-AfctfjB-r*. h7 Kk-»^M314 5 3T?tt* 
*HB7>/^y^;X-f h*t«V^6tlTV^fcff (H4 

i*ifimtoZtlT\*&t*tt (04 6) , /W h7FW> 



>^^^l-;1/4 5 3fiffl** HJ-A0MJ&7 Kl/X 
*HS<0 J: 5 (CUV y yT^o 
[0 17 4] H4 2©S#a^VX*Ma4 5 4 *g|4' 

(#J*.«4 6 0) ^YX^miT*^, Mffi 

m r ) 

r). 
r) 

2. saE©^(cS^T3ya-fe^iJ-e#TK^Sti 
To rsj fcfy h**0T?»titt, nyn-byij-ax-^ 

[0 17 5] H4 8ttr-^*fPIxi?^^©7*-V^ 
h*»bftHTf»5, £lTO*tt, B4 8fc3RS;ft&l/ 

[0 17 6] T-ZWfcWTstVt-^y V 

[0 17 7] 

[*8] 



(39) 1MHM1-8 
75 76 
*8: x-*8ttUi?X$7;t— 1*jb 





2**PJ 


Is3 




Is2 




tel 




IsO 




suben 


\&<<*--})\, 


replicate 




wrmask 




cmsb 




normalize 


0= 1 \**jfcr-Wt29x&h 
1 =zeyh-r-ji*^S?x»h 
2 = 4tf <y h^— * tf-^i?x? h 
3=fle7hf-^^x^h 


bo 





P 


1=y*-/£XKI— A 


if 


























3=ft 





[0 17 8] 
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cc 




L 


1 = fit* : f r --*^<DJt*-1'>$ 


what 


rKwre— k 



**** 7©T"<T«*#fetffttrjSTrtt***«. 20 to 1 8 o] ntsn^r-^^-fy 
cn5©ia*--&t>-a-©-«o**tjft*o*«e**4^ Co i 8 i] 

■fa. #^fc#LT«tt;!^**^5>Kfcti*x- [*9] 



( 4i ) i -8 

79 80 
S 9: Expected Data Types 





^^>H A 


*^5>F B 


5T^7>H C 




* 




(Pixel 


(Operand 


(Operand 


(Result 




Organizer) 


Organizer B) 


Organizer C) 


Organizer) 




ps 


px 


ps 


px(T) 


ps 


ub 


px 


PS 










bl(B) 


ub 




ub 


ub 












const 








GCSC 


ps 


ift 


mcsc 


mcsc 


mcsc 


mcsc 








ift 




scsc 


scsc 


SCSC 


scsc 












(B) 


(B) , 


<B) 


(B) 






JPEG EEitSf 


ps 


pb 


ct 


et(B) 


et 


et(B) 


ub 


ps 




us 




(B) 




(B) 








JPEG 


ps 


pb 


fdt 


fdt 


fdt 


fdt 


pb 


ps 








sdt 


sdt 


sdt 


sdt 




ub 








(B) 


(B) 


(B) 


(B) 






rv»ta mriino 

1, Win MAIUJJC 


ps 




et 


et 


ct 


et 


px 


PS 




ub 


pb 


fdt 


fidt 


Hi 


fdt 


pb 


ub 






ab 


sdt 


sdt 


sdt 


sdt 


ub 










( B ) 


(B) 


(B) 


(B) 








skd 


skd 


U(B) 


k(B) 


it(B) 


it(B) 


px 


PS 


a* 


Ikd 


Ucd 












ub 




ps 


px 


mm 


mm 


mm 


mm 


f» 


ps 




ub 




(B) 


(B) 


(B) 






ub 




P« 


P* 


PS 










PS 


> 


ub 


pb 


ub 


pb 






pb 


ub 






ub 




ub 






ab 






PS 


px 










px 


PS 




ub 


pb 












ub 







nb 










ub 






ps 


px 


ps 


px 






px 






ub 


pb 


ub 


pb 






pb 


ub 






nb 




ub 






ub 






ps 


P* 










px 


PS 




ub 


pb 










pb 


ub 






ub 










ub 






















r*U^5T^-k 
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packed stream (/ 1 y 9 £ ttfc * b 'J - A) 


Pb 


packed bytes 0^7 ^ H tl/t/W h) 


ub 


unpackedbytcs (/W££tlT^&WW N) 


px 


pixels (0§^) 


bl 


blend 


const 


constant (^^C) 


mcsc 


4 output channei ( 4 ft ^ 1* ^ J W 


scsc 


1 output channel color conversion tabic ( 1 ^fhtl y — ^J&t"— 


tit 


Interval and Fraction tables (MMEOS^T—y M 


et 


JPEG encoding table (JPEG W^flSx— 


| fdt 


fast JPEG decoding table (SJI JPEG a#7—^;W 





slow JPEG decoding table JPEG 


skd 


short kernel descriptor (MV^^^^BBijrf') 


Ikd 


long kernel descriptor (Jl<^;fr— -^HBSlrf) 


mm 


matrix co-efficient table (HMi&ffcf— ^OW 


it 


image table Offex— 


(B) 




an 


^ 5 > K -T JUT? fe «fc Iri 




. ■- 1 -■- 1 r™™™- 1 



[0 18 5] 3. 16 x-^IESfclHlK 
H4 9MU 3aO*««^D5/^*#b3>tf3.-^y 
?7>r y^A7niry^£5SbTv>£ 0 3oco±tSffiy 
D'^B, \Zi7*)\f*'-iJi-'f*f2 4 6 fc*^5>K* 
-2f:MlfB* C2 4 7, 2 4 8+©5 f -*jE*ft;«l 30 
0 6 2, £x~*/1A 2 4 2&£V>(;U PE Gg&2 4 1 
COf^y^^^ >^XX>^X #1*$!fflgP2 3 5+0 
yn^7^>^X-yx>h 1 0 6 4m X-^IE 
£{b$l 0 6 2 t*&tfy7 4 yi7XX>>z?><DW)fc 
tt> /P^7$>yx~^x>hl 0 6 4^<0f8M*A h 
U-Al 0 6 4K:<i;oTife£$n5o fr**z:fcfc* 7 

Di/^5>i/x»i;x>M o 6 4ttflnwa«fttfv^ 

rtBBMWfl^ 1067 i" 1068 ftS/XrAtOtt©^ 

ny^lcffl^fSo &A^x-^7-K 1 0 6 9 £<!: 

fc, iE^fefflS 1 0 6 2\m^\zM^X^-^Oyir 40 

**y^7^y^xx>>>>i o 6 3ic3im-r^« 

[0 18 6] x-^iEfiflrtBtt, »*fclih!^-fe;U*- 
X-^*a«rCjEJBflSbfc«, JPEG:fS^ffc&£v>tt 



&ft^CDA^x-^7-Fl 069fe32tfy hifS^r 

tt*©7*-v? hT*oT*>i:V>. /^£»*A*X 
hU-Att, x-**:/^x*h*«l, 2, 4, 8, 1 

U-Att, 8 tfy hCD/Vf h0*#W»T*£J:3fc3 

Sh^-fe^x-* 1 l tt, ftW8 MBTJfeB 
$n§i;5^1 t 2, 3, 4fl©*JBI^ir*;W^JsS 
*. 

[0 1 8 81 ms Ott, x-^IEglffcS&l 0 6 2 0ft* 

«1 0 6 2HU FIFOA777 (FIFO) 107 
3, 3 2tfy J\A*W?A* (REG 1) , 3 2 b> h 
fflAl^X* (REG2) , j0B{bv;l^^l/^-!H 0 
7 5, IMMB1 0 7 6*>&jfc-5. A#x~^7-K 1 0 
6 9teF I FOl 0 7 3C#ttStlfc«> (REG1) 
1 0 7 4 tf ^T©A*ify KjWJrSttWJ7*-V? h 

&> ,i 67 4 m^wmMHmuu s * 
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3 2ffl^bitX-f^5 1 ^«lx.^o BP -5, JEMik-?)^ 
yi^m 0 7 5ttx [6 3. . 3 2] tx [31. . 
0] itiSn§2^3 2t*7 1 0 7 

7, 1 0 7 8 U^tt^o 

[0 18 9] ClCD.t'Sfe^SSfflV^^tT, WCfflM* 
fflSfci^TF I FO^M<^t)2"3^tir"^7 

7xy3 L r§*ffi(Ccfc5fc<0T*5. 0fSx-^7-H 
*5Wi*:/^x£ h**F I FOA^7r4 J ^P^-r^> 10 
A*x-^7- KlCt£/«*S^tt r5' v7°J e;ntV^ 

F I FOA^774 J CDPS'r-^7-K^^^g^^^ 

^a?- * 7 - h wjEmttM \zx-h znz^vfzm 

[0 19 0] fOTrate, RE G 1 1 0 7 4^REG2 
1 0 7 6 6HJW-f5<*-:/;Wf*tREG 1_EN 1 20 

0 8 0^REG2_EN [3. . 0] 10 8 1 
Z>ttb\Z, FIFO 1 0 7 S^iE^bV^^W^ 

1 0 7 5 ^WWTSft^**)**^. 
[0191] B4 9 ©yp^a >^x-^x> h 1 0 
6 4ttx-^jE£fbffil 0 6 2 IC»bT^©J:'5&«* 
fl^SSIffl-r*. FIFO_WR4«#, mM{k7y? 
[2. . 0] , fcfy h*7-fey hb [2. . 

01 , ^tW*^>hc [1. . 0] , *HB7*-V 

n = 0 

y [i] = x [7-b] 0^i^7tE>££ 
n = 1 <£>t§£\ 

y [ i] =x [7-b] i = 1, 

y [i] -x [6-b] i=0, 
n = 2©«£\ 



84 



yh (E) t^fcfl^T?**. A*T-*tt, 
— ^^SffiTS^Dy^U-^ ^^^ttCF I FO_WR 
ft^l 0 8 5£iMffi-f£££:trJ;(3, FIFO1073 
ffi«##iMa&v>££i;:fcL FIFO 
ttf i f o_f u 1 ltt^7^1 0 8 6 £2!tht-3o 
3 2t + ^hA^f-^^^n^«h, *Uffi7*-Vy 

(E=l) T*S^7>;iy^A-f h (E=0) rs*> 
^»bn§. E — 1 OOlg-gi;:^ IE^fb7 7^^- 
teAy^A hU-£©#glR-tM'X£&&. W*>* n = 
0(11 MB©BSi, n-lii2t:7rggS, n = 
2ii4t^hga*. n = 3tt8b:s/ h*OMi* n>3 

-f hO***WS*7f»-5fc!^'feJk c = 2ftS/h2/Wh 
**«»-C , *£fcf£-fe;k c = 3»4»/h3/M h**W«n? 
*Sfcf^-fe;k c = 0ttT^T©4yWh**WJ»T*5 

[0 19 2] /^£XHJ-A**8t:? HSETFiDSK 
rt^lSSJfr&Ktt, h*7-fey h*iREGlfc#!tt 
$nTViS«T*Sx [3 1. . 0] + G>x-£'*fl ! SBII 

feftg£&£-r£. h*7-fe^ F^ttUfeoA*;^ 

*/Why [7. . 0] ©t**ft»4ttT©iCT#>L5 
[0 19 3] 



3, 
2, 



5, 
4, 



7<Dt£ 
6<Dt% 



[3] 
[2] 
[1] 
[0] 
[7] 
[6] 
[5] 
[4] 



= x 

= X 
= X 

= x 



[7-b] 
[6-b] 
[5-b] 
[4-b] 
[3] 



= y 

= y [2] 
= y [1] 
= y [0] 
n = 3 <Z><§£\ 
y [i] =x [i] 

y [7. . . 0] =x [1 5. 
tl 5. . 8] , y t2 3. . 1 

2 4j fc4&+*AfeM*rc&*. 



0£ i ^7<Dt% 



8] 




( 
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Srffll-^S^-trtt, bfy h^7-ky h£A*A-f KcOfi 
ft. A*fi*«**«*ffl*«Ut±©i:#^tt, mas* 
H^ffl^l-^^ii^ckoT^^^flOo JiSTH 16 
I^O, 1 6 f'y HA»B*^0)STT8 t:y UBS 

mm^fctoKLT^s. io 

[0 1 95] i50 ©©JffllgSte n [ 2 . . 0] tc 
tl. . 0] G>«*fSfTV\ CtiStb [2. . 0] t 

G 1 ^>R E G 2 *-^JHI§&tfiJt5. £ 

MMttREG 1 tCAAT-^tlKnSft© 

tfy F&gi n_b i t [4. . 0] t. &fi?-?0 
t^&**te#£?Ift<BA*>r hfilou t__by t e 
[4. . 0] *e*T*# MffttMU 
fflg#»7U&B#jSn?» i n__b i t [4. . 0] ©ffi 20 

TA* 7 Steffi U FIFO^W.CVU^D^^ 

-tH^HCfc^TF I FO__RD{f^£j£ffl-f 

F I FOi^illM^PItet^ ff^ f i f o_em 

b_t r unc [2. . . 0] =0 

= b 
= b 
= b 

( r&j ttvyhztom&iBm*xrr) 

COkofcmWlZkQ, mS OICiSWTMUXO, MU 30 
XI. . . MUX 3 lTS£nTV>£ftjE&fcv;^:/ 
U^WMX^ «H«MBftfflVi&Vifc*©3 2-l 

0 - 1 £Tfl£M$tl5o C©*-f X«/hfcJ:DHWME 

[0198] ei±©j;5e» ff*ftjtt«Trtt, x-* 

3. 17 7£1:5l/-*#-F0B*ft«»fc 

02 £&2IC:fcl>T* AftHMB2 3 .5tt3^tHry* 40 
2 2 4lC*^THfrSft*»fffc(Mtaft-B#«* T|| 
frTSj c *fTS*l***«, £5*-*/ , tt«2 4 2fc 

[0 1 9 9] 3. 1 7. 1 
& 5 1 »i, ±x-*AXffi 2 4 2 (CfcViT*S^n^-& 
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pty, f i fo_ful lttFIFOtti77yf* 
0, F I FO^t^r-^^WlTt^^^^l: f i 
f o__emp ty=L F 1 FOfflJJl:' h£\Z f i f 
o_ fu] 1 = 1 V I F O R D 7) ! - ^ ui $ ti 'It 9 

UyZV^ZMZ&^T, REG 1 EN09iIffi£tl, 

IfrbV^x-^^REG 1 KJ&D&^n&o REG2^ 

0] , REG 1 fo<D&W&&*<D%%hWm&. REG2 

ic*v^T*ffiffl^v*;i/»^3^©ffl*T<7)fi/hffl*<h 

oilchT, REG 2 E N [3. . 0] £gtgt"5. E 

= 0 ©^iCte, REG l4»JCtt-^C0W«lB*l/^S 
ftbfcK REG2^fii6§5 1 t^WS§^tlfec 

[3. . o] tmh^m&iz, ^u&fiv-vm^ 
-tzmmzttm-rzctiz^?), h5oos«a<6»s 

h*IRMMfittIE»ft;:7 r * ^-Icft^S fectrc* D > 

[0 19 7] 

n^3CD*§-& 

[2. . . 0] n=0©|§ 

[2. . . 1] n = 1 

[2] &" 0 0" n = 2CD^£- 

> H 4 6 5 **tf. uuT, fi**»JHatt 

777K >>V>F\ tr^-fe;k *>f;^£*refcfi 

V>. 77^ Wl/>Ht:HLm A^/ffi* 

<&&fc#, ^1/>H4*«4 6 7 fc*V^Tfe*Stl*. 
Sir, A*x-^fc> *-*9>F5*-*4 6 5ftl«a 
ii-^MSx-^ 4 6 6 fcfc^tfo 
[0 2 0 0] »3fi©J:5K* S^fc^ir^-^teS^ 
tWl/W hWT?*5 4:3©^**;W)&>Sj5fc*. £ 

X TThomas Porter and Tom D 
uff" Compositing Digital I 
mages" in Computer Graphic 
s, volume 18, number 3, July 

1 9 8 4j fc2©«PB**#M3nfcV>. 
[0 2 0 1] 3yDt*;*tt^|/*|[f-^tiV^t 
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7U*J¥U 7umWtStltc&tl 4 7 2, 4 7 3 £#* [0 2 0 2] ^ftfP 

^t**T€r§, £f£S$4 7 5t4. M&<DfiritT-?\Z& [0 2 0 3] 







(a C o-»o)wer(b 0)( b 0 ) 


(aco^coO-^ao+boC 1 -^)) 






(a ct Ai) out ( b co b a) 


(a^l-b^l-b^) 


(aa»^ ato P( b co> b o) 


(acoVWl-aoJW 






(aco>a<>)P Ius O'coAi) 


(w^aco+bco-Kajj+bo- 




255)/255)+r(6lamp(a 0 +b| D )- 




255)/255 ) clflmp(a 0 +b a )) 


(*aA>> loadzem OW^ 


(0,0) 




0>coA>) 




(W'o) 


(»co.ao) I<wdoo (b OT> b^) 


(W>o) 



[0 2 0 4] CCT, (a co, ao) t4> fiact^ 20 

[0 2 0 5] #y>7/7y ¥>ifU4 7 611 fMI&fg 
0-2 5 5rtl;:x-^£^7>7\ £145 

3><D r7>l/U**J 4 7 7jQ.Sf 3£<!:fcT£, *> 
fcOK^Iifl^fcEmifcfeT**. tti^x- 30 

[0 2 0 6] i5 2H ^**L««fr3BK:±x-^/X 

j^*^i^-^H^©tt©#^*tawan*, p a 7^ 

-;H*H* S-C0r-^hU-A4 6 4 (05 1) * 

Pb7>f-;PH«m2^5 ; -^XhU-A4 6 5*7^ 
S^SifcMifcftSU Pr7^-*PUftt4 7 7 40 
fcJHV>TlS** r7>7l^l?J 
C7^-;PH»4*SH0-2 5 5rtlC7^^*SVW4^5 
>y> *Wl7n-*5V>tt7>^-7D-^**>£7 
*>ftiRU rcom-codej 7-f — JH«»4i:C!?*^ 

YV¥7,$ (mdp.Dor) ft^ViSCfcfeTf#S. 
'i<D*7'ky Uiyy \Ly>f/>? 7 > t* >yft8#trfe 



[0 2 0 7] ftfcifc^ftH 1 0 (Z)^W^7- Fff*fc 
2 8 0*4, ^^7>Hc0fcJ6ttI^t6ni)c * 

^7>HAtt«tC*g*7-H<!:l^-t&§, ^Otztb, 
#^7>FAtt3j-^7>FB t^lCrt^^ FB*<fc 

+ 0AEiB : FttA*7*-Vy h*E3£U RESIrf** 

[0 2 0 8] 05 3*4, :/l/>F*^0flM*7-F7'* 

h^-W4 7 o ttTSbtv^o ?»Fffl 
mz, &?**)vz:t<Dmi&m4 7 1 timu 7 2 1 
•css^ns, nmz. 054*4, p-ovrvvxa 

6, JW*7t^b4 7 7, SS4 7 8tJ;oTl3lS 

jWftStt^A ton, h 5 5 hjb 5 4 ©7 ^ 
-;wF4 7 6-4 7 8siwjrf£Bn?as. m^th 

1/X4 7 6*4^;M*U<E>BBi&7FI/A£, 

/Hh^ ^-f;WS4 7 8t47^7 p 'T^^<0^>f;i/S 

[0 2 0 9] 05 1 fc*V>T* fiS*^7S9iftatt« 
«4 6 6K:it>T««S-&6n«2:tt»*^. tttXtttt 

1. ft^0*^5>FC7-F + fc«*77£**v>ft 
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3. M **©*^5>KC7-KO 

4. St5V7h l ?l7*ffl'/iT. \<DtZ\Ziry. 2 
5. 

[0 2 10] «g*te&*tftL©0~2 5 5 ©I|:T$ 

Coa = i o a X A 2 5 

*;i< 

[0 2 11] 3. 17. 2 figmg*** 

02 t^2(C*3V^T. ±x-^/1XSB2 4 2 tf-^* 

x-r^7'W'ftcabfc7*-v-> h> *>6JB-oftaB 

7*-V-y h (0J*.«CYM£5WiCYMKWS<]l;:jI 

*^W»ffi2 3 5(lCBu s 2 3 lSr^UT. ^x-* 
/17.SB2 4 2. x-^*V^->a0J®Sl52 4 0. ATM 

>^7x-AX-f7f 2 5 2> e^-fe^^-^x-y^ 

4 6, MUVA->7r2 5 0. tf^7 > f**-#7M If 
B2 4 7, ^5>H*-^'f1fC2 4 8, 
#TM1f2 4 9£#tfi£U fi*i*E-HTrlMF*-*J:7 

X862 4 2Cltfi7-fe;i/7-f>rtlc2lta$n-5. ±f 30 
-^/tXfii52 4 2 (02) «A7J'f>^7x-XX'f >y 
?2 5 2*^tf^t:;^-^7-1'1f 2 4 6£5>LTfcr* 

-my>a2 3 0fcn-FSn5. if-^/W2 
4 2ttine.©«C^7>H^-^7-'rifB > a-ft 
LT77"feXU C!l^.«RGBfi?gM^P.CYM*SVi 

•fe;ir£tg**— #7M tf 2 4 9te£5. -fex-*/^ 40 
2 4 2. x-^*^^->ol2 3 0. x-*(WW«52 4 
0, <i!l©ffiae©x/Vrxte. iSHS-f&fflBB 2 3 5©fW$© 
t>iT. *-fflTJ-8t63ffl (SOGCS) *ift*-P 

»zwm&mji-m&&m (mogcs) *»*-h 

$2 4 0^x-***yv'a2 3 0©fi&ffflfcWLTtt, 
rx-^+^y^ilfel^i^Vy^aJ 2 40, 2 3 

b (102) ^yfe^Hlv|.^:ji- i h ,i ;v jtv: ; "' 

to2i2i m^mz^Mhzuxmm® ; 



7, RGB -fcJhS* <i C Y M Kfi^iS: ©«S©££J. '4 

-7MfflV^n5. mfLU2 4t*7 h©RGBfcf7±: 
C7> A73 feifi * f =3 ..a... <h, .'.7 z)l-Z:C 

YMKfiSri»8h-->l-±eg« (->7» lCV-ytf> 
y-f 5)1^:7 7. y 7r— 7>« 1 6 ^tf/W 

I^^tr, 2 4t£-y hRGBtf7-fe;P£CYM 
KfiaiB0 4 7©8t7 hifeH^lwV-y K>tr?Z>)\, 

fcg*!&*&fi&*>. IinKttbT. ±f-^/U2 4 2 
(02) «, r-Wtr>a2 3 0\zmht<tltt)ly 
7777r~7*;U£fflK A*fiffllB«fOjjCfcfiv»ffl* 

a. m-ffiTJ-iefe^ffl (SOGCS) 

m-^s^tttftai^fejcsit-H (sogcs) t 

(MOGCS) Mtfc, RG BfeSgfUM 8 t'-y K© 

S. RGB££^©&RGB*7ctel 5©ERi1l::#fiJ£ 
ft. j ett-m©KP3©ftSfi7 P V>*©RGB?*>'E>CY 

£. BPS, €»H»*»^V»#^^14*ij5-r«^CttKra 
*.tf-f>*$?xyh) ©«!l3£3n&!fta*»::£:3v>T, £ 

®H8c*i&fBt* a t^wft-r * z. t *>*jm-c*> *>. 
A*t;i'-fe;w©fi-ft^^^;pj:i:»c. essntoi 5© 

Kra*©ttB*^»&n«. if©KWCA7jftS^ffi^ 
^ft-r^^Sft^-rS&J&t, A73fig&ffi*t#SE-f-5 
KMrt©tfc8&ft)t-rSfc»4:© 2 ^Of-WfT 
.-^/XX«.2 4 2 ^*V»TJfiV»&n«i. 

7*;i/*fflV»Tfc6Vi. 
[0 2 13] Wj!fi©«k5irRGB©&*7tttl 5 ©Kffl 

ictfSdsns. w*. RGBftfi!BttBweK«6nfc 

3*7c9x>f xm&tti.'D'TlSQ* KM©W«©A*lf 

; ^ 7 i> ^yjHtiwfc^Htviir *t^tJx^«ti- ! 




( 47 ) ¥fffi¥l 1 - 8 5 9 6 9 
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T&ti&mmmtzctTXtbznz* zo^mzx sbpo, pi P7«m kba 

0, *eS^^U*fflV^ttn«&&ftV^SttSfi fiSIf rac_r ( frac_g, frac__b£ 

[0 2 14] 05 6fcL A*RGBfilfi7-fe;HC»L ICV (P0) ^bCV (P7) <Z>B*KBrt&«S£ 

Tv »JS-r-5Kffl^KrartOffitt*ft5£-r5«4 8 0 $: Sffi^TttB-rs. 

^UTV>£ 0 2 4 t*7 bA*Jt!^-ir;i/0 8 [0 2 17] iiBffl3«, £~f# (R) £fa<£> 1 &7C*i 

h*^ hA^fe^^^^^^t-Kr 0 l7 L -7^4 8 2^EPp1 B£fTV>* temp 11, temp 12, tempi 

rtffiKx- >OU4 8 3 Sffl^TUffSnS. ^5 6t"4o 3, t emp 1 4©f *HT^5*6S. 

Wt, 8t:y hA*ftS*4 8 lttl 0iiftO4*;W [0 2 18] t emp 1 1=CV (P0) + f r a c„ 

^U"JBS:^«^UfcfeOT*S*^ u0 8h^hAA 10 r (CV (PI) -CV (P0) ) 

fegH4 8 l^EP B 1^-y;U^KPflrt<ftfi-r-^;U^ templ2=CV (P2) + frac_r (CV (P 

Jl/7^77^lTfflt^n^o KPflf— ^4 8 2 3) -CV (P2) ) 

tt* A*ftfi*ll4 8 1 A*?¥ffi-r«> 0 1 4^TC0E t emp 1 3=CV (P4) + f r a c_r (CV (P 

woi^&4bfyh"caj*"r-6, KBrtf-^ 5) -cv (P4) ) 

;U4 8 2tt, X*fifflS*4 8 l##fcT5KBrtT© t emp 1 4=CV (P6) + f r a c__r (CV (P 

KBrtf-^Hi, 0*52 5 5*t?©« 7) -CV (P6) ) 
H©8t:y HB***ftbTfcO, dO«[J42 5 6©»» iWIH KT^^^T t emp2 1, t 

tLTffiMZtlZ. JEot, 1 0 31Sc4ft;t>f^-U-Tf emp2 2^, ft (G) 1 ^TcttBOfW* 

*6tobfcA*ft«l»*4 8 1 <7)^(-«, *T£. 

;i/4 8 2 6;i/ty^7y^-rs^fc"C, ffl*«0A^ricS 20 [0219] t emp 2 1 = t emp 1 1 + f r a c_ 

n£o *&, A*fi4*Knrtffi«f-- ^4 8 3T?^ g (t emp 1 2 - t emp 1 1) 

y&TyT-tZZ-tlZ&t)* #8cl 6 0/2 5 6 £^fc> t emp22=t emp 1 3 + f r ac_g (temp 

TttlAI16 0*«t^n5. KBf-^H8 2tK 14- temp 13) 

ttl^toTStfeSns. [0 2 2 0] f i n a 1 = t emp 2 1 + f r a g_b 

[0 2 15] _hi£(£>SD, &RGBft»*fc**UTKB ( t emp 2 2 - t emp 2 1) 

(H2) tcn-K$n> i^swe±r-^/u2 4 «H©«»&D©fiftse«iufciRira*b<fev^'f* 

) ;b7r 2 5 QomA^as 7tzj&t. MUV;s?7r RWl/TfcD, A*teHtt*ffl*«H«l-l*7C^yK 

250 (H5 7) a* *n*n3»«ftft»*fc#iS'r*3 >i^r*«^iwsanTv>*. A**fc#"r* 

r?©|R«4 8 8, 4 8 9, 4 9 0fc»»6tl«. U*lt««jiSt 5 1 0 t 5 1 1 fWfcSoTV** feat* 

(«M.tt4 8 8) tt, Hfc4lfy h©KBf— ^4:8 So «*©Ul*«#j«5 1 2ir^5>^an«t-r* 

h©KBrttt«x-^tC:»»6ft«. 1 2tfy b* £5 1 U4r©**a©W*T?*»ntfft6*V». 

MU*4 9 2tt^T r -^X*2 4 2CJ:t>TMUV/X «oT, 5 1 0 4:5 1 1 ©2 0©j»S»B-r*»£fc 
^7r 2 5 03*6ftA*fi9 L ^*^crtfc»0IB*ti *5 1 5*qfB«ta:D* A*£ 5 1 6 fcttffl** 

*. 1 0JUfc4©*-A*ft»*©±»«l , xrtt* 12K 40 5 1 7*WtSo b^U ttHOfMSOTWttttV^i: 

y hHWlttO 0 0 0 0 1 0 1 0 0 OOtft^ *KHWlttW*5 1 8fcft*J:5fc»£-fcri, 

[0216] 05 sat, *ra«La©«ftiRUfcBrc* ^rufeaattfivy t:>^T**tttiR63Stv^ 5 

*. ttB&9tt. 1^©3^7C^B5 0 0 (W^LttRG 1 0 t 5 1 8 t©»B«tt, A*J«5 1 6 K#bTffl* 

B&^fffl) frSffioeSB («j*-«CMY»«VittCM tt5 19fetJtr«, 2 ^©ttWjtt 5 1 7 t 

YK) ^©ttBtf^fcttST**. K£-fe;i>P0*>&P 5 1 9 ©Hgfct, WfclKH©«i*fcO©fi*W»f 

7ttRGBA^fi^wrtT?a<#ftUT^o, w*fi^ ^2i/Si/«ii:o<»f»ta*, ^©n«ta» 
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out= 2 (x- 6 4) 6 4^x^ 1 9 1 <Dt£ 

2 5 5 1 9 2^x<Dt£ 

BI5 8ir*3V^T, MPflfflStt, RGBtfi7-t;i/«r#-tti 

-K-cfc, RGBtf^n^T^T^tt^figiiici^ 

te. -si o o^tf^ir;u^n^na*jcfe^aefi^ 

Hia©8:3<Dffi (P0-P7) **f<£oltS^t*^ 

[0222] 115 7 iz^xmrnbtzMKi , 
sBtoar. ±x-^axsb2 4 2&#, a, 



94 



10 



[0 2 2 3] H60I1 i£- 1 2 t: y h 7 F U-X 5 2 0 

* 7P-Btfe«. 1 2 t; r 7 K I X 5 2 0 £-f 
fe£g[$188 1 (BHD oio&f-J + tyyi 

(BO, Bl B7) \ZtttZ>%m<b§\iv ' 

7va (H2) tt, 8fl<m£i©^im>;7 5 2 2i: 

^SJ^n, ^n^n^8ii^^-r>//w h7 Ki/xfi 

tt£ 1 2 h> h7 HI/X 5 2 0^6 8 7<>/;H h7 

[0 2 2 4] SOGCSt-FttWT^X^ 

[0 2 2 5] 

«12] 



£?lCiji-©l 2f-; h7Kl-X (I R, I G, IB) 

a 12: SOGCS^-HCDTFUX^ 





Bit [8:6] 


Bit [5S] 


Bit [2rf>] 


Bank 7 


Rp;l] 


Gf3:ll 


Br3:l] 


Bank 6 


R[3;l] 


G|3:ll 


B[3:l]+B[01 


Bank5 


R[3:l] 


Gr3:ll+Gr01 


B[3:l] 


Bank 4 


Rf3:l] 


G[3:il+Gf0] 


Bf3;n+BfO] 


Ban!: 3 


R[3:1>R(0] 


Gr3:11 


Bf3;ll 


Bank 2 


Rr3:l]+Rf01 


Gp:l] 


Br3:ll+Br01 


Bankl 


RT3:n+Rrai 


Gr3:ll+Gf01 


Br3:ll 


BankO 


Rf3:ll + R(0] 


Grarll+GfO] 


Br3:ll+Bf01 



[0 2 2 6] BIT [8:6], BIT [5 : 

3] , BIT [2 : 0] tt, *-ft-£n9 My hn>?7 
FUX©6^£> 8 t'y K 3fr£5b'yK 0d^2f 
yhfcjST. tt. R [3: 1] , G [3: 1] , B 

[3 : 1] ttl 2 k'y hTFl'* 5 2 0O4b> hKIS 
IR, I G, I B©Hl*>e.&3 bfy h*T*^T. 

[0 2 2 7] 81 2©.**y/1>£5K:|ILT» 1 2 If 

2tf5/h7Kk7.5 2 0*O4tyh^KfflI r©l~ 
3 tfy h*t9k'y h7( 4 PXB5©6~8 ty hfcVy 

tf>ysn, 4t>hi&Emi g©i~3t*s/h#;&og 

SftT9b*y h7KWXB5©3~5ty MCTyKV 
f$n, 4£?MtBmi b©l~3fc*y h*<9£y h 
7HV-XB5©0~2b*-> MCTy fc*>:?;*tt5. 
[0 2 2 8] 8 "3©?-f >/JW h7F 2 1 
5 1 2 X 8 f >y h^e,^-5^-r-5^ : 6U A*>^ 5 2 2 

2 3 'J A* 5 2 2fr£7y^$*U>. 

c©7Ht-»^«aa»cj;nd Sg;£P0~P7l;:*ffc 



^T©A>£T0O0 0 0 0 0 0 0 <hH3ll-©A* 
30 >i77\ t U7,tm^n^ifi, 1 2ty bTFl^O 0 0 

o o o o o o o o i ©«£-!;:«> a*>*7, 5, 3, 

1T«A>^7HPX0 0 0 0 0 0 0 0 0 t&tK A 
>£6, 4> 2. 0 -CttA>^ 7H l^^O 0 0 0 0 0 

o o i £ts.z>&5\zmuz>n>>77 fi^#*#5*i 
*. ^©.fc^icbT, A2>tr*-fe;MB*moB&8o© 
*-ffl*fi«tcv (po) ~cv (P7) *tn*c*;t 

; e>JA>^A^»e»n, ^t'JA^^JC^V^Tffi^fefil 

[0 2 2 93 B6 114. ^-fi^*- Kfc43V>TfflH 
40 5n«f-^*ty>a2 3 0©^ ; EUA>^©«J5KS 
&pt=EUA>^ttl 2 8 5^ >X>hUA> 
*>f£D, 67-f>X>HJ»43 2K-yHT4X8 fcfy 
M^'J 5 3 3~5 3 6fr&Mj&3tU5. ^ ; E'J7Hk 
2 l©±7k> M4, ^tUTFl/X+Ottfc-r* 
T-*?lJ£fci£U ^ ; EUA>^ffl^tUT7^?5 4 
2"TSfcJ6fcfflV^ns. T2k*yM4A'Th7Fl'X 
T, 7*h9-lV9** 4 3^©A*i:feD, if©4x8 




( 

95 

*^^>K*-jSf^1f2 4 8 (B2) fr&l 2fc!yK 

2 4 7, 2 4 8Ktti*"f*. 

[0 2 3 0] H6 OII^^T, ±x-^/tX^2 4 2 
(0 2) WM1^3Xf7^fTt^ 

*R^5 5 0) VSMfctZt^Vrtyt (0J;Uf5 2 
2) A^Hl**nSfifii:*ErafifiS*5 5 1S:A^ 
£U mlE^^miXx^7 P tC^^T4OC0m^ffl^ 

ffl-g-ts. sufy^a* (w^5 5 3, 55 

4) iii2Xf7^5 5 6tCi££ft> f rac^gAA 
5 5 7 *fflViT»2^f-y^ , Ofi9iCHtEoTffl* 5 5 8 
feH-J^TS. Sftfc, ^2Xf7^ 5 5 8, 5 5 9 
tfra c_b A*> 5 6 2 ££fflV>T, MSXS^T 

[0 2 3 1] ^6 OltSbfcJliH, ^*tf»*©X;P 

fc, H6 0©*fttt*-ffl*fi»*5 6 33i«i&»&tS 
fcfflv^ns. B6 0©¥iSlt *-rffl*H« 

i-, ^n^n^m^fe^?*AAx^^^ c t5^4Ax^ 
b. ftftma-ittfcgra^-F 

nyu-fe^U-2 2 4»4M0GCS ; e-HT©ftfP , bff5 
MOGCS^-HliVK^OjflCftRV^TSOCG 
St-PfcK^aUOkfFr*. MOGCSt-KTT 
HU 02Oix-^/XXlSS2 4 2, x-^+^yvafW 

4o©ifiB**HRffcffl*"r CKDfcfcfcttx-* 
+tr>^2 3 0©*KX#4te&Bfc&£*«. !2fgM 

M©i/4o*t#ifrr*. ffl*fiffiwoaoow*fi 

[0 2 3 2] &6 2tt, n^D-fcyU-aJWk^lr*^* 
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^Snftl 2t> h&5 7 0 »4Wifi©*l 2KfiEoT* 

(fi*lx_«5 7 2) ttH6 3«C*ViT£tTTiftWr5«t5 
t-ST^fcf>y*n, mT^^^'J/0^ 5 7 3 &JU 
y$7y?LXA'D<n&&t)3 L ^*)l5 7 4**f#*>n 
5. /^U/O^ 5 7 3 i*T5 1 2 X 3 2 f y h 
x>HJ<hft9#£^\ W^0 1 2 8 X 3 2 t:yh 
X>h'J£&$T£o ^U/O^ 5 7 3fij-^4'-t 
y->j.2 3 0©-!S£&U H6 3"CSiWrs < i:'5^* 

[0 2 3 3] H6 3tt, 9 t:y h/X>^A^ 5 7 8**5 

7 9Kf?Vy tfV^n^T ^r^LT^D, tyh5 

8 0-5 8 2£DH#^Atl#x.^>Ci:(Ci:D^ ; E:UA^ 

i»«r«t!^^;MB*«i3ii;+vy'>afi*c!)x>f y 

[0 2 3 4] »*rt3ft&* ; fcy7Hl<'X5 7 9H 

n^n^3 2tfy k©i 2 8x>hu*«&*s»*-r-& 

^y/Ojr («A«5 8 5) ^©7HI/Xi:l/TfflV^ 
£tt3 a 7\£y hy-OTFUX&m^Tt^V 5 8 5 
tC7^-feX*T^^<hT, ^^U^>^c:i:tC5^^5 8 
6Sn§BJ*^#5tl-5. ^^'J/O^ («*.«5 8 
5) a, f 2 H 7 h © 1 2 8 X > h U fi£ ^ 

ma^^tutw-r-s. 7 h^^Ki^Mu 

CiO^^^U 5 8 7$<DM&'tZ>?tf\Z7i7'teX'rZ> 
fcfcfcfefll^&tt*. 7KW 5 7 9C!)i*2b>h* 
5 8 7+©#*T*^^i:tt(frr-6 2:tTr, 

j£3tl£. 210 9lfy h7HUX + ©«*2litybtt* 

M) o ^T, MOGCS€-FTIt RGBAAM 
IB^#t«k^C*V»T»»J:<4*Kfc»«ISft, 9 
fc> YT FWX©«*2 h**RGBA*fiKffl + <0 
*E£»jrr*z:fcfc&*. EP^, 2a©hf?h*yfc 

nan*. &*?^>0&^tt£#£'r* 

* u /v->©z-r y 7x 3 £ 

[0 2 3 5] 2o©tfyh^^HftbftV^Ktt, 

b» * * y k# m u * y a 7 ? yj&s 1 1 

*^y>3.;^^7^Xffla»4, 2tf^h^^x> 
h y cwis-r*?^ ^o-r^rott^w^^y 
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(2 4 2) \Z 

7s f- v y 5 9 

[0 2 3 6] 
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e&stu ±a©araji&si (H6 2) m 
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0 2^t 



l*A 5 9 5 t!5. 

7yj7<>7i^fTSS©^ 116 2 



[0 2 3 7] U-fU^yViflfio 2-D<D&'£m&&=£- 

b (010) T?0^tJ^{bttEtTO*jfi*#T§, 

[0 2 3 8] feSPa^SilCiS^Sft^ff^ti 
[0 2 3 9] 
[SI 2 A] 





urn 






Operand A 








Operand B 








Operand C 






I&F 


Result 


■* 










IVW b 





[0 2 4 0] H64H tt£lflM*7 ^ft 

%v^±-*7a-m^ikimT<D£?\zteZ> 0 [Si 3] 





mm 


teans[3:0] 




M 





[0 24 3] H6 5tt* MOGCS^-PIC^V^T* R 
G B t!^ir;UX h U -AtCYMKfi«t3li»t*¥S 
*«LTV^4&. Xfy^SlfciOT, 24h>hRG 
BK^1r*X>U-A^K^1r;V*-^--f If 2 4 6 

(02) fcAASft*. Xf7yS2TH 05 6 £0 

5 mmwLrc£o\z, tr^-fe^*— *f^-rtf 2 4 e*« 

;i/^i7T^7 p 7 L -^SrffiViT«-A^B*<2 4 
IB«t8lfyhKHl'itt«teftfer-B. A^tf^ir^ 



0 (02) lCS§o Xr*y7 r S4T?tt, *12tH62 

0d«21©l 2Ky h7h*WX7-K£8^tf>9 t:y h7 
40 FWXfcS&WS, £n.&©8r?©:TFl'XBU 80© 
tiiAiCV (P0) -CV (P7) ©^U/O^S 7 
3 (06 2) *©ffi«£3R-T. 7f^yS5Tlt x~ 
m^>a*(M2 4 0 (02) *«8^©9tyh7 
Fl^XS. 06 3T«WU&J:5fcSTyi;>^«. 



( 51 ) 
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y h?W9 My h7\ i UX<DWi±2 My hfc-RLfc 

nn\t. m*fefficv (po) -cv (P7) 

ya^^'J 2 3 0 C:fiFfiEb&V>. fcTXf^S 7 
*V>T, 2 My h??X.>hV\Ztt&?ZthJ3&mifi9V 

2 fciy h^y^9 try FU7><Dm±2 Myht-Wc 

S8l:fe^TB6 2(^-t5ViTiftWbfc5li!gT8OC0ttSA 
felCV (PO) -CV (P7) £&D£tit\ 

(cut, A^tf^-b^&aoffltrso^m^fificv 

(P0) -CV (P7) *^x-^/tXBB2 4 2lC«t^ 
Tr-^tr>a2 3 O^SftO&ins. Xf-y^ 

TffltffiffiCV (PO) -CV (P7) ^x-^/^X 

[0 2 4 5] RGBfiara^jsrsai*fiffi 

2 7ny£M;:#«fS*&fctt, f^tirr>aO» 
*W*ttlH Aft*® 1/32^7^7 «>*T&v>. 
[0 2 4 6] £fc, MOGCSt-KTffl^Sn^f- 

[0 2 4 7] 3. 17. 3 JPEG»tft/«f 

«!p©l:3fl«J PEG«pT?*5**> JPEGWPfcll 
tSSfflftMVan Nostrand Rein 
ho 1 dfciO 1 9 9 3?l:|il)S$nfcPenneba 
kertMi t c he 1 1 KJ;*9*J&:* TJ PEG : 
Still Image Data Compress 
ion Standard] &#BSStt&V>. 3^Pt 
yD-2 2 4 14 J PEG@ip<E>1t:/-try HfcJBWClMfc* 

»j«r^o jpEG«*©flj»tt, ■*««»iyfc** 

IBUT»<ft'r«fc»fctttt©*lp*«V^T'bAV^ J P 
EG**tt»n*tttA<lD6tlfc«qSTr*D, AS I 

*©M*ij PEG378ftftS6ttr«aN»6# 

tO 2 4 8] dyD-fe^*2fe4ti, 1, 3, 
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<D3&^m (BPS, t!^-tr;Ux-^o^C7)RGB) 

te, 3yai!^2 2 4li3fift*Wft/h3/H h 

10 [ 0 2 4 9 ] J P E G«WttiB«*«/]A«F-tffc;fflffi (M 

CU) tl*«n§/hSsS:2*7cSBfi:tC^f»i*r§. 

1 6 tf i7-fe;k «8 K^-feJKOMCUTffefiVib, ^ 
>^>yj >y*tlTVi&tiB«0*'&©«8 tf^ir 
;k *£8 tf^irJKDMCUTfcSV^ 

[0 2 5 0] B6 613* 3»*Hft*^#>lJ->:/U> 

[0 2 5 1] 0 Ott, &\Zt7±)V 

20 tt6 0 1*tYUVfiSmt©Y, U, VJMW>£j£5If 
m»itMUVA*;772 5 0 (H2) fclMAStt 
TV^5. ^©f-^«Sf40©f-^yn^^7 6 0 1 

y£tt«*COfiB*££2K 7*D^ 6 0 1, 6 0 2H 

a8-y->y^sn&Yji*'r*D, 7n>y^ 6 0 3, 6 

££X\ nyDir^-9-2 2 4fi21SW^ 

[0 2 5 2] fei-^OJ PEG*yf>^J>yit 
06 7tC^Ufc4fi5 1 ^*;^^>^U>^T?*So 
21©*^>yy>yT!t4, 1 6X8t!i7ir;l/6 1 0<O 
K^-feJHr-^^nyjr«fa#OY. U, VB^fciOA. 
T*S9!£Bi!S (0) ££&4£*6 1 l£#LTV> 

cotr^ir;Px-^6 1 ot>H6 6tra«fc*^ 
40 [0 2 5 3] l**u £o#&fc»au 

ffiViTx-^^n^^6 1 2, 6 13A«$tl§c 

[0254] B6 8tt, m2<DJ PEG#*tft;g§2 4 1 
£J;D§£fflKtft$Lfc0T*£c JPEG«FWfi^ 
S241H J PEG«F^ftt«#t©3R**ff5. «F 

2 46 (02) *6^Dyi?f-^*afltS. ^Dy 

M&fcxx*/ yi±&*te*t*. fcn&^x#l, 
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2. Dcrmtjomj-ite 2 2 

3. *Nt«6 2 2T*fr3ttS^if^X**>l;:J; 
5DCT^»G>Efi 

4. 0Hfttt#{fcgfi 2 3 TMTcflo DC DC . :!A 

o^m^mttA c dc t«s©7 >v>i? znmk 

\aMt^)V^-fU{7^6 2 5 iRbu s 6 2 6 ^iMT 
^**-*^1f6 2 9 (0 2) KigSftSo 
[0 2 5 5] JPEG«§i8ill J PEGfJ^fbftff 10 

u s 6 2 Oj&^ffiHISftfc J PEGyn^^^n 
fflgS'&tf. JEffix-^teB u s 6 3 0 ^LTA7V 
>«F^ft«6 2 4(C3l5n, r-^*DCS»tAC7 
>W>i7XttC^^$n^o 'Alt, x-^tt«*«F#fb 

S6 2 3ic2i^n, ActDciW§^n, »*© 

D cWftfc^r&MTftffl***^* £ t tD C^& 
oa«T{fc;*«fTbn5. S^tc, DCTAB6 2 lfci5V> 

ft, Bus 6 3 1 &frLT*?)V3 L 7U>7V 6 2 5, B u 
s 6 2 6 *^bT«**-^f^-f iftCSietl^. J P E 
G«F#ft;S§2 4 KS, J PEG^fb8©ftf^*BB*S$ 

«^*Stifft^o ■rHsse 2 2 t/\y 

-^JPx-^tt, Obus-f^7i-X«6 3 4** 
bT7^irX3ft£ 0 CCTObu s< >^7x-XgB 30 
6 3 4 It**? > r^-tfrM "if B 2 4 7 fcftlttStl, 
5 s -^*lry>'3L«»*2 4 0 iffUBU&3. 
tO 2 5 6] DCT»6 2 1liem7-^l:#l/T 

»S*rt*D, rs t i 1 1 Image Data 
Compression Standardj (f^±) 
O+ICbESftTV^fc©©, DCT6 2 ltt£tT©3B 

r«aDCTgBj T»»r**a#s««viTv^. 

ft*, DCT*i*»fffc*5V>TI4, The Trans 40 
actions of the IEICE, vol. 
E7 1, no. 11, November 1 9 8 8(7)1 
0 9 5^-$?fcJ8«attfcCA r a i ^>\Z^^sX TA 
Fast DCT-SQ Scheme for I 
mages J \Z&~3< DCT^ftfSMV^C <h*>T 

tO 2 5 7] fcHs»6 2 2«DCT«»<Z)ft^{b<kS£ 
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[0 2 5 8] H6 9«, mmTite 2 2^^Dmm\zm 

mLtcmT$>&. S^{b§&6 2 2&, rj-#;i//\*x£?r 
2 1 Cx-^^llf;^ DC 
T^v^-^e 2 1 7^6x~^£:§TOo7cOf oUC 
K>j7x-X6 4 0 Si^. ftTfb^BOK&^T 

fiT{b§§6 2 2 ^^D-^M^ •. r - IZ 2~j^l.- 

776 4 1, 6 4 2©10tC#*iL*n5. Zb7r6 
4 1, 6 4 2HA*T-^^77rt§^^2^© 

DCTD-^ya-JM 2 ld^CDftftr-^tt/t-V^ 
76 4 L 6 4 2<Dl-2\zmt&tlZ. rty?7ifi7)V 

&th2tl, Ob u s-f >^7x-XSB6 3 4*^UTS 
»lftifftttfeoT*SS6 4 3tll^§o £ 
©WAttfll»«F#ft< >?7x-X 6 4 5 S^LTSft 

«F^ft:«6 2 3 (H6 8) KCgstts. £ft£><D$ag 

mt&iftT^Z. J PEGa^j&St*5^T, Mf(b 

nfcD c T#*s*ff -rs - tT»*Ffcia3£fT 5 . 

fflV^tl^. &43, a^MTib^-y^co-Ox^^X 

[0 2 5 9] STftfflatl^flJC 2WV;77 6 4 
1, 6 4 2^S«Ff«6 2 3 (H6 8) &*><DA 

fHStt, DCTO?7x-X6 4 0^lTDCTf 
yWa-*6 2 1 (06 8) (Cj@ie.nSo foT, & 

DCT-f >^7x-X ; t^a-;i/6 4 0 
tCC<>^7x-X ; E^a-;i/6 4 5 tfe/Vy7 7* 
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fc£-T6<, SSS6 4 3I1 1 6 X 8 ©2 

[0 2 6 0] B6 8I^*3^T, ffijftff *Hb« 6 2 3 

(a) J PEGqE-KltiiWSDCffiS^flJ^fb 

(b) J PEG ; E-KlC*5W§ACffijR©7>U>^ 

«S«F*ft;»6 2 3tt, K>^^y^fc«SSnTV^ 
J;3K, DC/ACff»© : F«/5>l/>^Xtf^ft;/ 
a^tfr5. J PEG«Pfc»j£;5nTV>*J:<5 

jsjpeg Acm^oy>u>^7,nmt/mmzm 

[0 2 6 1] A7V>«F-tfb»6 2 4tt, JPEGj- 
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A7V>X h 'J— £**P bus< >^7x-X 62.0^ 

SR^ffSfiFfft^ya-^S 2 3Cg^n§ 0 A77 
>«F#{bS§6 2 4«t, x-^ + vy^ifcfciKtett, o 
bu s-f >^7x-X6 3 4£;frbT7^irX£n£A 

io 7-?>j—zf)),*:mmtz>. A7?>f-yj^ 

[0 2 6 2] A7V>^{blCi5^Tx-^*^y->j. 

ir x - * t— :/;i/£telfrt 3 o 

[0 2 6 3] x-^ + ^y^ilCfelBStlTV^A^T 

[0 2 6 4] 
Wl 4] 







0 


/fc/?Ol* BHUFCO.DCJ fc^lM* EHUFCO 256 OflD 16 fcf 
0<7J 16 t*9hflB££K*-&, COD^E'J/O^OD^rO!) 128 ^-f^tf 


1 


1* EHUFCO.DC2 7~})UO 258 ®<D 16 £?k^HJ£1HA 








2 


it>02f* EHUFCOAC.2 256 16 t?7KC/MJ£tttt 
**.-f>1hf*X^Wf?Mt 32 fcTyh«3-F*(D20d!> 16 fcf^HS 


3 


/fc/£3l* EHUFCOAC2 256 &<D 16 e?HX>K>&tt® 
<>^$x<D&/J*e?H&< 02 e?h9-K*<D2O<0 16 tfvHg 


4 


/t>^4fA EHUFSI.DCJ &Sl*t* EHUFS3 -r— -?JU0 256 @<D4£vh 
x>MJ^BOiFSIJ)CJ? ?JUD 256 S(D4£?KC/W£±&|Z(§ 

S&tfco c<o^u/^ot-<r<z> 128 


5 


/O^Sti EHUFSI.ACJ x— ? JHD 256 H<D4tf ?hx:/MJ * . 

ehufs^c.2x-^;ko 256 ®mt!?hi>W^fcfcJt&8^§ tt CO) 
J^ij/Oiwr ^TID 1285<>#jm*6tflS<> 


6 




7 


ttWttS3rHST-lM) 126 S<D 24 evK^Mj£*&EW$ 0 ^ 
y/t^©*KT© 128 vW>«Ha^3/WK6i5A* 0 



[0 2 6 5) H7 0K6HT, A7V>$J*|fl:«6 2 4 
«. tap^ft386 6 Ottg#g 6 6 1 £V>2~D<Z>®$.<DZf 
ny?frP,±\Z.mf$.2tlZ>. WSOZfUyZS 6 0, 6 
6 1 ttV^yi^+ht^i-^e 6 2 t^bTNCO 

bus* >^x-xi^;;WE fe*i . i 




it? >r 4 

a. fiMWfc 

■jfefed^ ft.tri— t«2Tfcfei Bt>^dn*> ^Jbir 
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- 7Mmftm& M \z 9 * v 7 v A> S ^ A □ - H 

oh* 1 ; h .-&n T A->i?/H K^^£^£fi£„ A' 

X' 0 RSTm7-^«M 
££ttv-#jWfA3ft5**, CtfXRIctt, It^;\7 

T>#^f(£> r 1 j fcVy 

*lfcA*< h#X* FFCi^/ci^X' 0 0/WKJf 
Affl g**fr frft 5 . RSTm7-^ j&t&B if 5 * 
CM7-;y:; - JH^oT^^ " :. , Li C 

^^a-i^t Pbus-CCXl/-y-f>^7i" 
X±<Z> rg^oj m^TCDjf^i-J:^ pSi*©R»K:E 
O I A~f EO I AiMlC^ 

a, #Affla**#»fc&*. awjAhu-AMt 

[0 2 6 6] ItJPEG^-FtOBSIClt CCij"^ 
va-Jl^ (Pbus-CCX^-^>^7i-X) * 

5. (J PEG*-I*i:|i?«fc) ^7'>a 

fu ig*^--^<if 2 4 9^21 ens. &*5, tb^x 

b. «*t 

«*t7;k*UXA»4, iCS (U7;kMA) e>fc<D<h& 
ii^fe CO K»T;^UXAttJ PEG*- 

F©#T»fl5U fi*7;P^UXA»4J PEG*- FT 
t)#JPE Gt- KTt>»ff-r«. 
[0 2 6 7] «ajPEGA77>a«7;W«JXA 
HU /\77>^>*;^DCfMWV^AC7>l/ 

«0/\7T>f-^ <K3, K4, K5, K6) 

14, *iryVa^ : EUft#JR'r*Clt&<«#T*«J: 
5 fc, T)VJV XA*lCA- Fttfc«»&*nTV>*. 
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[0 2 7 3] 



fct>0T*5. A>F (RSTmV-*TK«e»tlfcy 

h«©««*t;p-^ 

AAA HJ-A®*8J©8 If y h£ I NDEXt 
HUFVAL (INDEX) t7x7f 
I f HUFVAL (INDEX) == 0 0 x x 0 6 0 1 1 1 . 



14, l^D-/^*<^;i/TH«lOC0DC/ACffiftT 

HC^y*i?a.-;i < hcc-y-y*vjL-;i/p B iT 

Sfc»(C, 19Uy9*M9)l'i&&\ZtSiZ>Z.th$>Z> 

[0 2 6 8] A7V>ffi^SttiS5ifi*-HT»ifPl/, £ 
n ^ £ -y-^f $ )V Z t iz 1 a 7 v > ~> > tK;!/ * ttffi-f 3 . 

1:2 6 9] £ /v. . ; t-7°(c 

s.tJ<eaa^7;urfuxA*«^T*o, ®7iic^ 

tig® 6 7 0 t&oW, 
. [0 2 7 0] J PEGff^fbAHJ-AfcttbT, Ah 
UyA-6 7 lfc#WTX' 0 0jfA/WK X' FF 

ffi<£>lfy httfefcA7V>$/>aR;^>7^-6 7 2 
ICig^n^o A7V><0*©fJ^bA HJ—AT 

tt^cofflattfrfoftfcv*. 

20 [0271] /\y-?>is>#ji<mmDmw<DZTyy 

tt, A7T>x-^AhU-A©aaj0 8fcTy hT7K 
l/v>^$nfc+^^5>^lC^^nfcHUFVAL7 L 
-7>£>2 5 6C0X>HJ£;i^^7y7°t~£^T& 
C©ffi*We1""§A7V>5>>/R;i/(DSE^fiaT* 

6 7 2ic7>r-FA^2n, w«f sttimify h*ffl 

A7*-7-^-6 7 6fclE2iU &*fS$6 7 3 fcKfc 
A7V>AhU-AOj&rb^MJS8Cffi^Sa f 0'r§ o 
30 T, bW4«#«©HftT*«. IlO^hy? 

7y7#«*«fcfcSfc*>ofc«^ HP%A7V>-» 
*^8HyhSUT»ot»frl:tt, h-T^FI/A 

a*«-ff**u — grf££T> *5i*»4 r^®^jA7v> 

*y->Artfcfl» 7^HrA*^fT$tx^ a )Vv97y 
ififi-$C? « £ ±E £ n«o««*«ff toft, r^ifliA 

[0 2 7 2] fc-^fcat^<«7JWJ.XAttttTO 
DT*§, 
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(H I T) 

n n n bfy h£e e e e eeee ltffi<hLT$Ei£ 
5/>j$)V&N = <i e c i ma 1 (n n n) 

/ * 0 0 0 *< v y # )V ft 8 £ b T * / 

break 

else /* HUFVAL (INDEX) ==01 i i i iiii ii 
i i (MISS) 

HEAP I NDEX = = i i iiii iiii It-tey h (h-y^-Xfc 
0 \zl5M) 

N ~ 9 It-fey h 

I f XtiTs h U -AM 9 h*y hft 0Ti§ 
HEAP INDEX£l±iJD 
f i 

VALUE = HEAP (HEAP INDEX) ®7i^f (»9 fcfy KDff 



Loop 

If VALUE = = 0 0 0 1 0 0 0 0 llll--(NL) 
exit 

elseif VALU E ===== 1 0 0 0 eeee eeee 
eeee eeee £fil<hl/T&ii§ 

Art* 
break 

else /* VALUE==0 Iiii iiii iiii 
(MISS) 

N = N+llCt*;h (HEAP I NDEX= 1 i iiii i 

i i i) 

I f A#XMJ-A0JBNlfy h&O 
HEAP I NDEX£ imM 
f i 

V ALUE=HE AP (HEAP INDEX) <D7x.y?- 
pool 
pool 

XHJ?/t67 1HU Kti J PEG6 7 lfSMtftAHJ EGt-HWW) »3^>*;UK:l«<«lPlf ^ hftt 
-Ajfc&X' OOjfAA-fK X' FF§£2^A>fK ftSflU 6 7 2 fc*»t*5fcO^>*;P©BI»ftl» 

*«M»anfcftipify btfcfcfcs^e 7 2\zfem [0275] 9mm s 1 3\th-y\z&~3<T)iJvx 

T£o ;\yv>Ofr<Dftnmz&^T\m<DttlMVy h 40 A<2n7£^£LT£0, Obus 6 7 4&fiTx-* 

[0 2 7 4] ->7*6 7 2?Dy^ttl 6 tfy MB*l/ i/>tf)l&*V>?-, t-^W >f; i7Xjn^g|5, # 

*©A7V>-»*;Wft«#«5 6 7 3 It JOtTy h&£>«^lfB <**tttt#*fcS^V>TfTfeft 
(MSBfr&LSBOJHSOKy hXhU-^T) 1ES «) ft**.*. 7x7f7Fl/XHKW5 

Eftfzn<D\iyh*mtt?%frZ&tt%<b\m%®6 ; to 2 7 6] 7x>^7^X ; ! 
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1 ?*-;i>p (ify » 




! |32:7J1 


>£S^fc(d- quantization)^- 7)^0)% 5 1 


■ 124:191 


Not used. 


|18:91 




|8:0| 





[0 2 7 8] tti^7^-V^^-yp^i7 6 7 6 8 tf 
yhffi^fi^ (X^>F70 >/\77> ; £" H) , 2 
4t*7 hmtim\iy htR$Tm-?-tJ$imt<D3 2 10 
■. y ]• " : ■ ■ j P^G^-F' : "J; c 

\iv htt* «^SS6 7 3^v>^^^^^#i0b'7 

#7*-Vy* 6 7 6l;ite2£3tl£. *fc, a^7*- 

Hii5i^ffiV^2x>r-7 P F I FO/1y7 7*«*.TV> 

Bl*7*-77^6 7 6KSi6©l«^9-F (F 
IFO(Cifc^LTV^) £ fg&j <h LTj^tfj 

hU-AJCjRUTV^^HtC*jg©7-H*BJI» 

[0 2 7 9] 0 7 0l:fett-5A77>»tftg6 6 0 CD 
7yyf-^6*l/T/W hf-^^A77>y>Jp 
*7*— Vy^6 8 3* **yv^£7£irA£tt£ 

[0 2 8 0] XXm 8 BttT-***? ^afcttttS 

ft*. J PEGfflBK*V^T»4, AC^JS^DC^ifi:^ 



7^ > T^l/B* ^ «h K»J <9x - W&B £ & 3. # J 
PEGEHTTtt* 2^c7)^-7> (fJ^IMX) 4)^ 

X' ■■'('.. C o f.j' r j '~ : S. ~y ■' > o '"j ^ . - "1? . -A ci <f'L 

~>7^ 6 8 2 i&BWpO/K h/^ >f >^ilT$§ 
RSTm£EOI V-AJfAffiHfcfefr?. flX, x 
h\mtiy^~^y^6 8 3^teit$n, X' 

I:*4off;h HTOBaffla, /^*>^3tifcA 

tO 2 8 1] X' FF/H h^Jf A^S^v7^-6 8 
2\Z,k-DTfTt>tl%fc$>, HlA7*-77^ 6 8 3a 
X' FF/H hSrlflltJf ATSfcfe^, y7? 6 8 2^ 

Ctlte, /H H^JSbWS^^I/^* fe->7^- 
6 8 2rt^{^X.^clch^i;^Tfi : ^n^o /H 

6 8 2\Z£-DT*#m'}ZtlZ>. t$£«atf 6 8 3teV 

-Ai:Mox" ff m ;x>f hK*fctt#A»afitTto 

tO 2 8 2] A7V>«F*fts«t4 J PEG£E«tiViT 

4»5W48:3©5 L — :/;U£J8V>* 2 0<Dx-^U ; £K 

[0 2 8 3] A7V>##{bgl;:M>TfflV>£ft3^- 

[0 2 8 4] 
»16] 
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I m 




m 


EHUFSI 


256 


Huffinan 3 — K+M' X'. itf£A7 "7 >®- 
< >x y ^X t LTfem Sti*. 


EHUFCO 

i 


256 


it* A 7 T >ft*WbTttffl * *l* A 7 

t > □ - km. $ n^ffltt< > 


EHUFSI_DC_1 


16 


JPEG JESS"? DC»»*«H*ftt"S©t 


EHUFCO_DC_l 


16 


JPEG JSESgTr DC %WL nXCT SOI' 
fi£ffl$n^A7 > T>3-K<i. *£3 

ffl. -y-7*-y->7°;^nfc7'P7^(c ! b 


EHUFS1_DC_2 


16 


JPEG JSEI8T DC 0Mft fc«F*Ht*$ COIC 
ffiffl$nSA7-r>n-KiJ--rXo ± 

ttffl. it7*if>7 0 ;u$nfc7'o^^fc 


EHUFCO_DC_2 


16 


JPEG BEStT DC £ $f#fc-f 5 CO \Z 

-y-7*u->7°;^nfe7*D«>^n 


EHUFSI_AC_1 


256 


jpeg jefj&t ac m.*nmct*o\z 

&ffl$n*A7-?>n-FiMX. * 


EHUFCOAC1 


256 


JPEG fEBsT AC wStSiT^ltTo COtC: 
^Sn*A7T>3-F^X. * 
C9#x=f U t 9 > 1/>^X 


EHUFSI_AC_2 


256 


•tf-7*1*->7Vl/£ft5ji&#© JPEG 
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bH U r CO_AL_2 

i 1 


256 


vsv^yJl' 3 <17w07tW JPEG GEwn? 
U <h 7 > 1/ > yfv 0 X t L 



[0 2 8 5] 3. 17. 4 -r-^H- >f^{7z/yif 
a2 3 0lt &^-r>#*8 9-b + ^6J5t££ 1 2 8^-T> 

T~-7MZ?!?±7s?%. x- ^Jl/tt/h^V^ K2 5 6 
iga) &<£>T, Obus©32t*;h7KI/77^-JI/ 

Sl7:/\77>/lH:- 



20 



£3, 

[0 2 8 6] ±a!OJ:-5tC, J P E Gfiigffi^fc- FT 

[0 2 8 7] A7T> y 
X 

[0 2 8 8] 
[*1 7] 



0—3 



1024 & 16 evKOk-^B^fi^&fi/h^vr^xe^Hc^u 



4 


oc /A7^>«^f-^/Ko 12 e?h«aa>5%*+8if 512 mm 
-f> j fv^*<Da*2e^H=*y, a2 tr*h7-K*®4o© 

'<-f MM *^fc/*a>*v<T<D 1 28 7^jW8^S>*i5. 


5 


AC /\7T>ffi^r-^WS) 12 evh#B<Wfcfi']*8fcf5rKg 512 
#t** <>x?£X*©E*2lf?hl=J:y* 32 fc*9F>"7— 

HRBft»» > & «u/t/^ot^ra>i287-r>A<rat^6*tfi« 


6 


DCS&tffc AC /\^>SS#^^/l^)5t^4evht«}^So ^f> 
^xflWR^ztftrhfciy, ft9--F*a)4o©^^H£iBR-r* 0 


7 


B¥ftT-?*0 24 eVHKHt 128 Jt^'J/feMH-'* 
-C<K> 12B5-f><0a^3/WH&i5^€o 



[0 2 8 9] J PEG«F#fls882 4 1 (02) fc*V>T 40 
* (PO_IDR) **VM4 (RO_IDR) fcjg«& 
fete, ^©ftSttttar^#AA7*-**BftfcH* 



lMXtfJlfe«&*Kfe9&fe*ftT&D. #KJPE 

G«F#ft;/«^©«k 3 icffla^-^tM' X*i*»T?*-5 

fc*K#»-c**. cn?.©«8ii«, m*T-^*#ir 

m\t*y\Zt£-DT&Q> RO_CFGl/^X^*©®«!) 

u j p EG#^-r?ttc©tf y t^-ty ^-r-sfeste^ 
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[0291] H 7 3im^x. wm. mmmzmmt 

A^B^6 9 OH 6 9 1 t&Z>m6 9 2 

**WiW*£FftT5. U^b&A^, JPEGfSMtfbv 
XfATH8 x 8 e^ir^D^^^MIiin, -t© 
H«OS^8C0f&»i:)&:e^^a^, MCU6 

0±5fcffl9l;:.fc?K «»ffl*A>f h 6 9 8KB©x- 

& 18: t&<fr"7-K:-7-f*- 



1 - 8 5 9 6 9 
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[0 2 9 2] J PEGtt#SK*ttSBIB, fWHi«te#» 

«FtC*5aT*S^r-^ibT2OC0t|^ftS. mi CO 

7° 7 0 10-ffifi7 0 0 *ttia*S^tt»ft-rS«'&T! t 

$7 1 4K:feV>T, *»©^aXh'Jy^(8lliH 

7 11, 7 12, 7 13) ^}M!i&-&^«^^ 

[0 2 9 3] JPEGft^**7*-77h*7>f- 

[0 2 9 4] **«7-H--7-f±- ^yn-H7^-^' 

[0 2 9 5] 
W18] 





KM 


D 


0= flMHb <EH) 




1= flWfc (fMt) 


M 






1= v;i^ir*^ 


4 






1= 49-^^)1 


S 


0= ^/7yy-^>yU>^^^U^^ 





1 - n *^*>^U >^IIS5S££fflTS 


H 




C 




T 


1= ttiii&mvfexz 


F 


1= 1f^th>^'J>yOf)^ffie7^i^&ffl(f^ 



[0296] 3. 17. 5 x-^&^fcfSK? 
a-/a±y^2 2 41i@2©J PEGflF*t{fcg§©-S8£ 

LI*. A7V>fiftftttJPEG0^5fte 40 

tO 2 9 7] 3. 17. 6 iS&DCT&g 
fel5©S7 7fc^tfci;5feSt^3-y-'f (DC 
,+) .|fcire& its x ft :rtiy*©*b!rffltedLT i 



■fe;^D^^©2^7c^^*llffr-5. £©i:5fcg@ 
Ttt, A*®8 1 0 9 6, JHfHtt 1104, ffflHHK 

1098, e&^unissi o 9 0, ihaik&i 092 

[0 2 9 8] A#laJSSl 0 9 6H8X 8 ?> £ & 8 
h K^-feJNMMW*. Art B& 1 0 9 6 HU *M 
W^WllOO, 1 1 0 2 «M,TJWIfE«§ 1 
1 0 4 fc8M*3ttTH*. gflflsIES 1 1 0 4tt, 8X8 

5. fMPESS 1 0 9 8 tt, MO^TOiaRftflfffb. D 

jt*y i ti § d, ,^^i;b9 5i : ^^16 9 2 
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£-!M 1 0 2Kffl*£j£fflT£. ifc, vj^yi/^it 
1 1 0 2MWJ7s$ 1 0 9 4^£<£>x-^£ ! fog{rf 
K^IhISS 1 0 9 0 ^8 X8^D^7»^Wi 

TAl'U fr^Tr-^Sitb^l"^:- H. J r;j[il8S 1 0 9 

[0 2 9 9] iS#CODCTgHTtt, f£#rf^ 110 4 
[0 3 0 0] H7 7©»»IhIK1 1 0 4H, -|RtC*« 10 
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#ai/T»a£fT3. a^r, ftflia&Rii 144, i 

1 4 8, 1 1 5 2, 1 1 5 6*JO*8^*»*fei:©S 
d©«J:53&#tlflHl»l 1 0 4TH *-cd*«IhI8& 

awi 1 o 4oa^©ft8a»*3Wr'r*fc»fc«vi 

t>iiviia»ai»£t±tr36:ttti«fe6&v^fc», 

[0301] H7 8lt i77 COgRKiS W-£iI#©£ 

1144, 1148, 1152, 1156 O^ft 

<19)VZt^ l*7CDCT(D45[tagPil 1 4 4, 1 
148, 1152, 1156 ©tft^ftjWftfMcSft 

*fc, c^bbic^t^, 4*aaa»ii4 4, 

1148, 1152, 1156 ©-ttl^n^**©** 

[0 3 0 2] U*bfc*«&* &OTtt®XAIiafttfft 

atCftoTVia:V^tT*<5, 4fflaSK 1 1 4 4, 1 

14 8, 1152, 1 1 5 6 tt*n*n*«amw^3iw 

'OWTH^DyJ 1 1 4 4©2 0 n s) . A^^tB^V 
^TVi^l 14 6£1 1 5 4<Z>jie (fn?tl2n 

s) £, 7yy^7D^yil5 OCDjiM (3 ns) * 
£*T<h, £ft©Sl3S**2 7n s ifc*. «£t>T, ££>D 
CTifi!ctllil2 7ns "ClWFr*. 

[0 3 0 3] /Wy5<>»*©DCT«riGt>i:<ft6 



20 



30 



40 




[0 3 0 4] H7 9H, tfi7-fe^x-^^A*IiIKl 1 
2 6 K A* 3 ft, %}iyY\Z>7±)VT-*om*&iht 
Sct^&J PEGjff^fbffi (B2) K&UTffl^Sft^ 

fty^Utt, 2*7C««:3U-<>SSI©2laIBO^A* 

A^B&l 1 2 6 tfiJft^U 1 1 I 8^<7X*^ 
7Jk?-7V.:' ; r ]. 1 2 i cftv- .. . ,v ;-7'V ; 
>2/£ft, ffl^x-^A^JSllZS 1 1 2 2fCi£<oft£ 0 
SE*@Bll2 2©g*H 2@a^;^(7)«l7ftttS^ 

^ftgB+ox-^^fiEft&ftna-rso 
[0305] hhrhim >*SMaaossiiHii©/^T 

taa*Sft-6**Bi»«**«, A;>JB&1 1 2 6Kj£ 
£ft£„ £<E>/tXTte, v^^l/^U-i 1 2 4ttM» 
1 1 1 6KJ;oTR£3ft, A A SKI 1 2 6*>S 
1 1 2 2l:x-^^bn^ 0 
[0 3 0 6] H8 0(4, ff*lHl»l 1 2 2©*jfi*±0 

37*7- KB&l 1 3 8^^*^VJ^yix^1t 1 1 

2 4fct5i>Ti8«3ft5. ^£*e> i^-y- 1 i 

2 4ttftflflil&l 1 1 6KJ;oTR£;*ft5. £Hf?&n 
>jC*<0*ff©*^^tt, «»BK1 1 2 6C91££ 

tcs^v^x, mmmi 1 4 o^coa^v;^:/^ 

ij-l 1 4 2lC43V>T«&Sft*>. IBgW^XTte, & 
hJWWUmiti 112 2 (Mff BK 116 6 icJ: 
r>T»»fcKfcSft*) ic^oTjaaaftfca, 
^ t U 1118 lcSiiift§o 8X8^D 

t'Jll 1 8tcSaSft§^:, IB*3U--f >*lft<D2Iil 
B©/^**HJfi*ft*. 
[0 3 0 7] 7*7-H*«V^ttaMI«[3*>f 
2IeIBO/XA7?tt, ffWSCO^^hJWBft^tU 1 1 
1 8*&M*tB3tU V^yi/^U-l 12 4^LT 
JMKBRl 1 2 2fca6ft6, ^<D/tAT«, 

1 2 4«A^IUK1 1 3 6frt,0 J f-?«:tSk& 
b, Hft^tU 1 1 1 8*6©fr^ Wl^-^*JMIf 
OKI 12 2fc«a-T«J:5K, *»H*fcJ:oTBBg 
$ft§ 0 DWEIR 112 2 tCDV^y 1/ jr* 114 2 

MmKi i4o*6cD»*x-^*»isiaBi 12 

2©aWjfc»*. ^«0B1 1 2 2*6©»«3W6ft 
fe^/ST, MffBBl 1 1 6*6©»*fcaE^TW* 

t o 3 o 8 ] flcff SB i i 2 2 tt % km&zfenktz 
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SfiJB&l 1 2 2<Z)ffl**^OlB*^ hJU*ttl*l5lB' 
1 1 2 oic»oatrt»*^*iE«trjts-rsi:tA<T!* 

5 ( gflfESn 1 2 2lc*V^T+B!fEte*^-fc&V^* 

ftm&#<ZtifiT$&£th\Z* WWm&l 1 2 2$: 

x- * 5 <7) izwt z> mm* ft Mist <& 

[0 3 0 9] B81H ^#©SMAWtw 4 a^MJIg: 
1158, 1160, 1162, 1 1 6 4©fiK 20n 
s+10ns+12ns+15ns=57nst^0, 

ms i©BKK:isv>T, 

£#3 © It 4 £ n y # -tK ^ t -T £ £ , 

^*ODCT->XxAfc*ViT»/h*ffKfWtt5 7/4 
ns (14. 25ns) H7 8T»4DCT^n 20 

y Wft7)Ufi2 7ns £*;Sf«*»fcV>;ii:*jB*5 

[0310] *d cTmm<Dmmo^fT&f\~&^T\$, 

Yukihiro Arai, Takeshi Agu 
i, Mas ayuk i Nak a J ima^Ci^Th 
eTransact ions of the IEIC 
E, vol, E71, no. 1 1, 1 9 8 8^11^© 

^-i? 1095 izmmzntz&x m9k<oitibonMD 

CT-SQ#%J TiRSft&DCT7;MyXA*fflVi 
frTSCfcT, *DCT8WO»*Hlll 1 2 2 \Z& 

sizmwr&zttfim*. mmz, ttoDCTrjw 

UXASWIfHttl 1 2 2*fc7V-F t }x7<hbT62® 
[0 3 11] 3. 17. 7 A77>8§g 
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hS^]$^7V>tf^b$nT^Sc^V-A^^^ 

[0 3 13] tt^tftt, x-^«^l/^^^5A*S 
tt*»©W*tt, (v) iA*tff^<D£IR0SS 

fttfy h7-r-;n*<DSS (a) tawr*. ^ct, 

a«0^±COfi|-C*^ e ?I*Slf^ h7>f-;i/HHA7 
V>tf^ft*nTV^Vifc», t<l:/\7V>«Ffft3 

n*. a#«oA*+©fiSn©if^ h7>r-;n«tt* 

Kfi (m) *ft£U Wt^h (a) cDg2££fcl;: 
ffl»:/ny^fcte2i-f£. fHflJ:/ciy£tt$/7hffi (a 
+ m) *ft5£U 7-?/V-Ay7hyD7?*ei 
lT^g^^;H:IATAAf-^^y7 h-T 

[0 3 14] *««©8«Ttt, A*«F**©* 
B£<Z>S£* A7^>ffr^fc;£ttTV>ftV>H? h7-f — ^ 

M, RAM, PLA&4!©V^a4lft^-e'iaBS©«^« 

[0315] xmmm-vtt* JPEGg?t 
i^xtttiiMrr*. j PEGwreft&sti 

CtAC«»Oia*»^«fstoH5rh*3Rt'. 5*-* 
1/ ¥7, $ * Of- ^ * 6 S ftft A 7 V >^^fb 
StlTV^VMiy h 7 -f-JH 4 fcbTfi, J 
P E GillptC jgjfc $ tlT ± v \Z h t <D AJ] x - 9 7 
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[0 3i6] mmcDmm&mt, *»w©»a&*t 

{.{■\1jy~u yp&iMrl'L . [ii^yn^^i, JPEGl:^ 
nt^&^h'^ h7^-JUK^, J PEGlCi3frtS*7- 

*coj:-5i-S^j$nfcx*/ws $ nx ^ & v> 

[0 3 17] J P E GfJ^fbSS 2 4 1 (H2) Kcfc-^T 10 

wntiy hj ti*«n«»pjajfi«F#ft;anTv^^if 

#J rjf AAW hj rg§ii;i< H 

- ^ h *k > * y - y £ ftfc 7 v >#*tfb n - 
To 

[0 3 18] J PEG«F#flS«2 4 10/\77>S^ 20 

H8 3I4, J PEGr-^©A77>«if©I*6P 

ffllrSUTV^o XhU7/U 17 1 ttV-#«F*| 
^FFXXhex, XXIi*f) *I$£U ;W h (ft 
§FFhex) *i¥AU /H h (IffFfhexCS 
<ff§00hex) £SS&fr. CtlbW^TA^r- 
*©/K hSEMSftfcK*T»0* 3 2bfyh«7-Ht 

*tfyhtt, A*t*y hAhU-A©3teSK&£<, Ah 

[0 3 19] A*T-?^bU^Al 1 7 lfc^Dy 
^*>f^;Hri02:t©3 2E? h»7-F£LTA;*j£ 
n^o AWH 2 1 l£O^S3^<E**ttttf£0 

ssic^-ro (i) o/x>r hawA/w k «&a 

#^ (i -1) *60O»D©;Mh^hU?rAl 1 
7 lCfflArfefcS^hStU (i) fclMSfo 40 

^/n 1 7 ia>£ffl#£nfc/w h©w$Mt«, ®8 5 

fc3SSftT^5grJ©ffl*^y 1 2 1 2fc«toT«F#ftS 
tl£. A h U v A 1 1 7 1 fc£oTft*3tt& Wt-f h 

a-t H4, *ttW*/vr (a h U ^/^ i 1 7 i 

a\ to (a K u ?n i i 7 i rfeisn 
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tfct, V-*>/7^6jibTV-*U5;A^ 1183 
^v-*ffcB©n-H*fH»'rs. A*7-F*6l 

A^h<hffl*^y©0U2 1 3*^^ntV^ 0 



[ 0 3 2 0] 



^ 7*D y £ 1 1 7 2 , 1 1 7 3, 1 1 8 0, 1 J 8 1 CO 




1/ A * ic^SW (i =f - ? & □ - K -T -5 d <h 

7^#AHJyAl 1 7 1*&©x-**«HI?"£©1;: 
*fU 7-*y7^^^^il, V-#ftg£ 

3„ #Ahv7^ 118 0, 1181 ©ffiA14, 08 3 
iC^^tlTV^^^tCMj^-r^l/vA^ 1 1 8 2, 11 
8 3KiS»«3l3ttS. 

[0 3 2 1] H8 6l:>bf-^yi/y7* 1 1 7 2*^ 
£ttTl>5?&*, x-^7 f 'Iyv7^ 1 1 7 2(1, A h l J y 
/U17 l^&0f^l:3 2 «©-£□£&*£*;/ h 1 
2 5 1fc#iaU 6 4fcfy hlCx-^Srffia-TS. 

*t3Sx-^»4 6 4t> MSOAl/Jk>7? 12 5 2 
T^&fcx-^l^A* 1 1 8 2K£ffi#ffiT5fc:y h& 
«lt->7hSn*. £©&, KyUfctt* 18 

2, *7-# 1183 i/5?A^ftfc2n«i*©«n»ey h 
)W*«Er**t«tjiaibTv^«fPD^y^ 1185 

*bT, n\/)\sV*J7s9 \ 2 5 2»4, 
6 4tyht, 6 4®<D2 X lg^VJW^l^lM 2 
5 4^6«*v;U5 t 7 r ^iJ-^ny^ 1 2 5 3iCfi2|-r 
*• «*2XlVJ^mi 2 5 4tt, Al/;^> 
7^1 2 5 2*6© 1 t?-y h fc^-^ l/$?A^ 118 2 

^>c -*7j, ^T*S*^tCtt, AWUv7^ 12 5 2 

©hfy hfcttJ^rrs- ■T^T©S*v;^7 p ki7^i 2 

5 4^©*HI«W4, 08 6fc6tflCB8 7fc*tf«k 
^A^ 1 2 2 3©^l/3/7^«fieyb0. . . 5tL 
TSStlTV^±'5fclS(«l^ny^©->7 hfelfflllffi^ 
iDtt-^^n^o S+VJ^^^if 1 2 5 4©tH*l4 
AWl/v7^ 1 2 5 StcSl^n, H8 6fc**n«J:5 
fcSliy h«fMB#V7HWll2J:04*&ti*Hy h 
&^fclt^7h$tiS„ c*i6©lfyHl, x-^l/v 

a^ i i 8 2k^UTax-^©«#fcJ:oT<Kfflan 

.'*fciyHfcftStfc4©T*b* 5ga^A7V>ri-K 




J ' 1 . ... >• 
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\Vr-9WJ7»$ 1 1 8 2*<Dg®\iy hfeWMISStt 
Sty Hft£tT"e*tl«0*SUfct)Ot?45. dOJ; 
o\ZbX, ni/Jlz/y? 1 2 5 5^*>tti*;*tt55r-* 
Ktt, & JltDmZr-? 1182 

-*l^X* 1 1 8 2©rt*H, Mh^y h**«^3n 
SfcftKl^X**^^ HT^hSft, XMJy/U 

1 7 ] (K 8, 16, 2 4, 3 2 C7^ ? 7-^!/ 
vX* 1 1 8 2(r#ft]$n^ ( h^-3^:^t'^S$n 

So f-^l/y^? 1 1 8 2\m^X£&fi\-f<D+ftte 10 

<DT-*tfi&tE?tittmD''( 9 Mz& ^X n - H * n 

So i^^jn^^T^h'j^/n 17 l^eof- 

[0 3 2 2] 7* 117 3, #Xh>-7* 

118 1, v-#l^X^ 1183H r-?yi/y7 
^117 2> f-^#Xh->7^118 0, t-^I/^ 20 

x?ii82 t-tn-enH-©«ffi"c»§. «tt 1 1 7 

3, 118 1, 1 1 8 3fa<D^-$ryn-tet>Zf\Z^n 
60»ffiffl©f-^7D-fe, gBftl 1 7 2, 118 

tt»«##*fW« 1 1 8 5iOjR*0«fity McSS 

nso ^neoftttoijia, v-atv^^i 1 7 

3 tT-^7 r l/^7^ 1 1 7 2©AAr-^19Jt, V 
—%V 9 J7s9 118 3 tr-j^I/^^ 1 1 8 2©rt§ 
3j*£fl!>J:5fcMv^ti«*\ ii^jfln?**. 08 81;: 
^C$&o\Z, XhU^/U 1 7 9 \ 2 6 1 tt 30 

8tfy h7-FiUTA*S*U 

2 fcfl*5T-^httl:2 If 7 HlDSTSnT 

telT&£o XhUy/11 1 7 l*6WWffcSI6ft«4 
1 7 3tf>A*l 2 6 2£LT2lffl$n§o Cl^J:5lrL 

k, cneax-^i/^^ 1 1 8 2 *-cv-*3jt«fc 

l/Tt^o V-^l^X^ 1 1 8 3+©V-*tt«ty 
h^f-^kyX^ 1 1 8 2 HORIHW 

ttS*»VMtJ:oT, BS^Dyj 1 1 8 swt/tr-r > 



tV-* 118 3) H[M-©*tJ^ff^^#X.^nT^S 

[0 3 2 3] «*§ffl5l 1 8 4 (H8 9lZh^nx\^ 
S) te, f-^l/yX^ 1 1 8 2<D&±1 6 tiy h&A 

a#snfcA7v>ii, a^snss&oA^tj 

Artft#fcH<ttiDfcfy Kg («^ffi©IMfcfc& 

5) Sttffl^SfcfeClia^fc'ti-iHlKlg^fflSl 18 4i- 

chlC 1 ^^fia^m^ans^^r^^s^^ii^, /"\ 
7T>ffl©a^Slft*^bii:@gsyny^Tfftoatfn 
tf&SfcK «^««, 0 8 9\Z^T£.o\Z, 16tfy 
h h-#>*x-*U>?X^ 1 1 8 2^£A77U A7 

*?xi (8t£yh) , »jK-r§A7v>^ft3nfcx 

£ J: 5 ftlft^to-frlalBy a y £ t ITA- F 7< *S 
ttfc 4 P L AX ;i/©a-^T— ^;i/*«?LS £ £ 

[0 3 2 4] /^x>f>yfc> FOBlHSJBJite, Hfllffil 

1 8 5©-«T*S/^>yfc> h0«ft«C*3V>T 
x-^l/i^X^ 1 1 8 2tWf-f >yty h^W&ffi 

8 3, 1 2 4 2©8B;fcfcfy h + tCV-*ffi«ti5/ h# 

ftLfc^Kte, V-#lxi?X^ 1 2 4 2 *£>v-# t: 
y M^&^tf y MC^TSx-^kvX^ 1 1 8 

2 , 12 4 1 $<D^X<D \iy YtmtEOnf^ >^/M 

y>f\Ly h&m® 1243 ic^ott^t 1 

HJIftA8H, 1 1 8 2, 1 2 4 1 (RHP 

fcV-* 1 1 8 3, 1 2 4 2) 

S^>7^1 1 7 2, 1 1 7 3, 1 1 8 0, 1 1 8 1 *ffi 
v^T 1 9 n y>WJWSi:y7 h £ iiTfrto 

HWJ-f ^ ;UTf4?5: < a^«^-f i7 jp*t*ff a ns . 

fcfftotu f-^l/y7? 118 2 l:/^^f >ye^ h 

[0 3 2 5] 08 7tt, MBtfl 18 5*»ailCSbfc 
fe<DT*S« ««*©4»fcflWHttl/^X^ 1 2 2 3T^ 
b, x-^P^X^ 1 1 8 2*0^#^ftf^ 
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iSX? 1 2 2 3 fctettSnSx- 1182* 

117 2, ] 1 7 3, 118 0, 1 1 8 1, 1 1 3 6, 

118 7, fi-Sfffil 184, WttJ7t~~^y hgU 1 1 8 
8^<Dtt»«^©£j*T*£. ^HOflMEte, ±$<£>cfc 
^ilx-^l^vX^ 1 1 8 2+<7)/\ a -7V yVMy h<D& 

[0 3 2 6] x-^ V-J7^9 1 1 8 2 +©8? LV^t: y Y 
ft (n e w__no b) te, f-^l/yX^ 118 2 (n 

1 1 7 l^£>a-K^fg&fc:y hft (no s) (hCD^ff 
U MV'fi?MZ&^TT-i7l'VXi? 1 1 8 2*>6BJ 
BfrSttStfyhft (nor) S^bfcfcCDiLTffrJf 

[0 3 2 7] 

n e w n ob = nob + nos-nor 

£ft&©ft8ttJOJ¥Sl 2 2 1 £tt#«l 2 2 2 
fr^tlS. m&'f9MZ&\,*T7>hVyrtl 1 7 

l^x-^ArtanftV^^Kte (nos) ifiOt 
Sfc, x-^UvX^l 1 8 2^*ViTfc*-y hfl* 

8 5fr&©8ft#ft£«<tt»lf y hfit©fPKT7f* 

V^^l^fe (nos) I4 0<h&£„ fit (new__no 
b) tt6 4t«A*21t««*0» yD*;^1 2 2 4K* 

XHJy/U 1 7 lHffiHttitfeD, «fL 

V^x-^OD- Ktf&SftftK VJl^7V£"tf- 1 2 3 
314, XhU^/U 1 7 -lfr&D-FSnfctry Hfc* 

ifDicT^rc^icfflv^n^o .c;nr* xhu^n i 
7 i «<sfihs*45fln*ttH»snTv»av». «*fsts 1 2 

3 logoff* r/^-Oif+MZJVi \V?)\r?7\/!; 
*1 2 84t»MU /tx-f >ytfy h***W4t« 

K» (nor) t^TWRTZ. W%\*v h$t 
&T-*l'P7>*$*)\ZyY9L (nob) eUirfeS 
fc. fbttSl 2 2 8fc*viT«(!!fStl«t» 

2 3 4C4*.6n*^7hr<eW«tfy hRI4 
NANDY- M 2 3 0 fcJSVvtHfOKRJtaft*. f 
fcfe%, (nor) r4*UC«BfcS*u ?~9V*J7s9 
<D\Lv h©ffl*ttffbnftVi. VJl^l/^iM 2 3 4 
©ttlAtt, #Xhy7?l 182fcl 183 CDfelfflltCfc 
JHli&ns. f-^l^?7^ 1 1 8 2©fcttf y HD*; 
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[0328] a^-y--r ^;Kr*^TB'J»*nstf 7 hft 
cofhffttSDffS 1 2 2 6 lt*5^T^T^n^, ^ " > 
H(4SL^>-&i? J i-;.IlSS1g^ffll 1 8 4^bAA^no s . 

etyho^fia^aci^r 0000" 

fb;**l£fca&, " o u_r e d u c e" 

strw" 0000" 10000" iz^mt^n, 
timmi 2 2 KDititittf* d^iyt-^y 

10 1 1 8 6 t 1 1 8 7'\<D*J«ff 

u^m 1 7 iit*viTBij»$n§*^ xhu^/ti 1 
7 i7?Biji»$n§£t«ftE0 1 v-^i-5feioffiMtc# 

(4#ftT£« ttSSSS 1 2 2 9 Tt4, l^X* 1 2 2 3 K 
IMftStlTV^^-^l/^X^ 1 1 8 2+Olfy hffcA« 

20 (x-^UvX* 1 1 8 2*<a^$n^x-^«B©»t) 

eo i v-*©f9©/^ >y««iM'X*sT£<fci;: 

[0 3 3 0] ni/Jk>7^118 6, 1 187^:^7 

mi 1 8 8 ttt*#-hns»6WL» % 

30 KHPfl|^l4±Jfi©J:5e«fl«l 1 8 5 £D^x.<£>n 
#JOt^ hyi^v^^ 1 1 8 %njr-9V*?7s9fr 
63 2H5;h*A*U S*E«**tlTV^A7V>» 

^118 6 <Dttij]\Z&t)1t:T&\Z&WrZ> Ct\Zfcr), 
rtV)V*sy$ 1 1 8 7^<D\jlthTMt>n%>. (tJQb* 
y h#Xhy7^1 1 8 711, f-^OHlA7*-Vy 

7*;M19 6fc*^Tbfy h 8 4*6 If y h 1 8 fclta 
Sti* *IBc!)ftloifyh»tj6CTflk*lfyh©^<T3. 

*^4«sa!fc*«2:i'b**. cicDify hftttJ peg«s 

«PTftftSftTV^J:'5Kl 1 9 6©tfy h04^63lC 

t-ftffl^*«^fc»4, 7t-77M:^CT^ 
UJ^>7^ 1 1 8 6, 1 1 8 7 t-tcDHteftjtE-r^c 
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1 8 5*>S4A5tt5DC/AC«» (1 1 9 6, fciy 
h0*>5 7) £DC«ft*fSfcTy h (119 6, tyh 
1 9) , fltflfciy hv7^ 1 1 8 7A^#K.Sn 

£ftftit:y h (119 6, b 8^13) > 
l/y^^ 1 1 8 3*f»4i5nS7-*ffiie^ h (1 
19 6, t*7h2 3) <h£[£i9 1 \Z7tt7*-*?v V\Z 

1 8 8u, m^e^a^ >97x.-?,[zm-?z>m 

[0332] 3. 17. 8 Hfl«eftfr<fr 

[0 3 3 3] H9 2tt, V-XB*©»ffl&««**«*f 



7^*7 2 iT#i^n^o y A\z, t&<DW*mm7 2 2 

tm<DV 7^ > 7 C 'J > if9BM £ ^ o fc 2 :3 ©fflSA iffi* 



^yir^&ffiU ^USJf 7 2 3 A^ST&^i:^ 

J: on fs^^saaftfp*^.^ £ n * e a w t? ^ ^ 

r B ^nfc^^>7 p ^^lraS:*r^jn ; g-e*^. »#sn 
fcK^-fc;iHittS*af*a:*feTra»7 2 7 3*u @W 

[0 3 3 4] H»*«*^©fc»Oft^7-l««F#*H 

[0 3 3 5] ^7"K : T^t-t^- K7^ 
[0 3 3 6] 

ai9] 



7-r-;uK 


m 


s 




aff[3;0] 




P 




C 





0=ffl*M<DltttttO*lt4L 



[0 3 3 7] F^m^-f-JPFOBWifc 40 [0 3 3 9] 

ETFfcjSf. [g 2 0 ] 

[0 3 3 8] > FtmgV- h* 
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&9I 




JHf7*-Vyh 1 


Operand A 








Operand B 








Operand C 








Result 









•J— a. M'^a 



[0 3 4 0] ^5>f«AH. £IR©g&£5fc£-f5& 

o^ts-s (Aga*e?©Lt*-> h-oMten*) . 094 

5ttgV^^7*-v«/ h*^-T. ^-^JPBBifi^H, 20 

1. V-XB«H»ME*7 3 0 («F*fcL©H«ft. 
2 4. 2 4SM) . ft® (0. 0) *tHfc©£±. 

2. 3 1 tS*7 3 2 (-y-y-y->7» xJU^ 
(2CDMK. Hitft, 2 4. 2 4MME) 

3. «aoHJfeftffJ8ff*+OA'f^U*'f >h©tt 
Sfc^-f 3 My h©bp7-f-;H* 7 3 3 

4. (»«Er*«^Ktt) «MfJI«ft7 3 5. Jin 

7" •;&©&§■?& 5 2 OC/HtWC TpJ^j #-f> 30 
(2©*t&) TT**. 

5. A-^ffiifi^+WBOoy-Hlft^-rr 1 7 

7 3 6. £©tett^£f?&£*#tt& ■&>©*> 
&lft9lV>£;tf££«. , . , . 

to 3 4 i\i^hij-^)vm^^t\zA^tx 

p[n] = (l.offsetin] • mdp^ :0000) + J 

[0 3 4 6] rtfipwice, WMittft^+^crtoa 

l/CV>*. 3 6 h©«#<»tt«f#tt€f©2©iitSi: 
[0 3 4 7] 3. 17. 9 



[0 3 4 2] H9 6l:*V>T, ^5>h*B«V-XH 
-f -f >xy£7.T"--7>©8ljfittH9 

i&mti.V-7.W&M?-k}\/?)7>*\->y'{ > m%-\ti 

42) tjirrt^3«ijfiTr**. — aaic -r>x>>^x 

t— 7* ;U t 7 - 7. ® it tf 7 -fe ;v <h « * *y ~y a. pJ t£ T * 
[0 3 4 3] ^7>KC»4*¥/Sa-y-7*-9->7*;PU 

JMrWOJfcTCfcJ^TJMtett*. fr#I r £ c ©&tg 
MU M 9 7 fc^-f «t 5 CMMHAft*® 5*- * 7 - H fc 

[0 3 4 4] ^*t!^-fe;VP [N] ©^-fr^NttJKT 

[0 3 4 5] 
t»4] 

J w„.-s(x+rAx,y + <Ay)[n] 

e 

[0 3 4 8] V-XMWffiS (x, y) *#U nx 
m«a»*-*;pj5«i[C (x, y) S£C 
©fia*H tn] CnJUOft^Kl, 

[6349]. ... 
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H(x,y)[n] = (l.offsetn) • md Ppor :QOOO) + J J S(x + i,y + j)-C(i,j)[n] 

1T\ ie [0, c] , 



[0 3 5 0] T4x.£ft£o 
j e [0, r] 

[0 3 5 1] ;*7-fey hffiCMSi*. *BB*g*©#»ft, 
b p7>f-il/H coja*tt«»SE8lft* i: £. 
[0 3 5 2] B9 9tt, ftii*.*"*;U7 5 0**7-\X 

lift 7 5 ncsfflu &$mm7 5 2 £fe&-t&#j£^ 



turn* H*je»^^t^«^frfcn*. ^*^> 
Hfciift^aiit^^iCT^s. noon aii^ 

t^C9t^7- Kfr-^£^L/ct>C9T£)0, BIT©*** 

[0 3 5 3 ] fihfr7-F 
[0 3 5 4] 
[*2 1] 





an 


s 


o=4jafflv-xH«tr^tout»Lx3M^B*tft*i»a> 


c 




p 




A 




of(3:0] 


1 =^-7-b^Ku*?X5(0^^^;^<i523ffl&y 



[0355] 3. 17. 10 ftpmW 
[0 3 5 6] 



[0 3 5 7] ff^»»ft^*^7>Ht»*7* 

[0 3 5 8] ft^*^7>Ht«*7-h* 
[0 3 5 9] 
»2 2] 



:6] 










' b o,o 





















30 



"0,1 ^0,2 ^0,3 "0,4 

*U 6 U *W fe M 

^ ^2,2 *W ^4 

\l ^2 fr *3 *M 





mm 






Operand A 


v-xbm&ib* 






Operand B 


urn 98k 


torn 




Operand C 








Result 


MM 







[0 3 6 1] 
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1 

[*2 3] 




mm 


C 


1 =aj±"<£hA>7>*-7P- £ 0x00, OxFF 


P 






A 

1 .. 





[0 3 6 2] 3. 17. 11 A-7h->ft 

*£U->##*UT*^>;i*£Wi7>;*y 

i^&Ay^tfyh (2 U^A-? b * 



20 



[0 3 6 3] ffl*A-7h->fflttJKT©iC*ffl^Tffl' 

[0 3 6 4] (PX (1 - 1) +d) /255 
eieiT, pliKJtJHI (0^p^2 5 5) , lttl/^ 
;i* (2£ 1^2 5 5) , d^x-flffT^Jffl (0^d^ 
2 5 4) :*^>F#^«&T©5I0T*5. 

[0 3 6 5] #^^^>H^$g^y-H 

[0 3 6 6] 

[«2 43 



a 24; Kf^^a-h-feJztfUtf^h^— H 











Operand A 




sis 






m 































Operand B 




h 




Operand C 








Result 


/\—y h->n 
— K 


h 





I0 3 6 7] *^7-H*WW4. 

to 3 6 8 j 3. 17. 12 ||rtMMHfc:7*-V3>f- 



7> 6 ©»Rfl S ftfcUKftln $ 0 ©ffitfA 7 V > 
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[0370] mmmm&7*-^y h«§^2xf7 

(CiliaH7^^-y->> r . i U^-5 B C©7f^^TH It! 

[0 3 7 1 ] SBMra©»^tJ4 2fS, 4&E%t><om 

©t^irJUtfcS. HI 0 4IC, 2^C0A^7T"^Xh 
U-A7 7 0, 1 1 \ *m^T2{%mm<DWi-h7,VV- 

A *. 7 2 '! fS-r". ;0 A h U - A 7 7 3 £fc£T£ 

S 25: 
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[0 3 7 2] S«»#fflStt* 2^r-^Xh'J-A 

FA) f4^-XfflXhU-^T*0, S-Xh'J-A 
(t^7>HB) ttSffiXMJ-ATS>£. 
[0 3 7 3] il0 5C, S«IB£ffl3*m>fc»£0 
2^OA*Xh'J-A7 8 0, 7 8 1 t*fJK-r^Hl*X 
h U — A 7 8 2 

[0 3 7 4] B10 6I1 B)iMli»7*-Vv FfiH* 

10 ^— ^^n-K^^-Jl/HcoBaHS^-r. 

[ 0 3 7 5 ] f9M*7- ^— / -KV^-Jl^ 

F 

[0 3 7 6] 
[*2 5] 





nn 1 


R 




V 




F 




C 


0= 7 >?Lfctr» 

1 = ttR^ffiS^ 7 underflow to 

0x00, overflow to OxFF 



[0 377] 3. 17. 13 tf-l*M* 
«. 

a. WBt 8 -***** 

cne. ©#<£■(*> a?m >^7x-7. ; E^i-;k a* 

1* >*7x-X;W y?-2 5 2^ tf^-t^-^T-fif 
2 4 6, J PEG«F#flS«52 4 1, ^^-^^^2 
4 9, H)W>?7x-X ; E^a-M5&53ynt 
i/D-2 2 4l*30MOf-3'7n-/USfflV»5. CCD 

[0 3 7 8] x-^ftfPijfP©«60^tUT«filT© 
• U-^n-f Hit (t*y K 2 fc* y hit, 4 fcfy hf0 © 
7-FF*n?©A*-f h©/1y + >^t7>/^y*>^ 



30 



40 



M©8«1 



^^U 2 3 6 tffimi >?yx.-Z2 3 7KT5 i -^e 

si (5R7jrfij) *<fffcn§. vine.©^ttsifT^<fi©^ 

^t^-nyy7*-rSBt-©#^-T*-5. g^nS© 

[0 3 7 9] ^yribf-irfTCK, ^7>FAB3 

tt. ^7>HB{C«toTAA^©x-^SfP»lfPA^ 

[0 3 8 0] 3. 17. 14 yu-fflWfir<fr 
y a-UWfir$V£. B 9 KSLfc J; 5 fe#^Hff tr* 

©s*^rs&a5tar*ii»-r5fc«e>©^»Ta&s. 7p 

£ It 1 "2©<Kg7 H 1^7.*^ £><B©7 H VX^<D®Wi$:»l 
m\z?%0ifttfgi?*>y°&z>u\t&&teLi?*>y°$: 
%ti. &fttt£?*>y'ti!*\*. 3 7"cHrytH>k>?x 




( 70 ) 

137 

[0 3 8 1] H 1 0 7 \Z, 7 U-mm^O^^ H 
?. £tT©attv>r^-*yn-K©KWT* [0 3 8 3] 

%>„ [£2 6] 

: '3^2] r>ir^7"l— -■7';' ^7^-;^ 

& 26: flrtfr9-K:-7-f+-*Ka-K7.f— juK 
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type 


UU = /T// 

01= 1 


C 






1= ftttftSft^DAfrfc-fe? h 


N 


0= *fr3WK(Z»fc^>^ 


1° 


1= »*Lfc**a«H7TS®*fto 



[0 3 8 4] $?*>7'4MN;:*SV>Ttt, ^7>FA7 
Hl^X^jt^L, *^7>KB7- h'Oii^l^^^rt 

gttttsrrsMtfc*. fr^^nc 7-1* 

[0 3 8 5] ( ( ( r e g i s t e r_v a 1 u e x 
or Operand B) andOperand 30 
C) =0 x 0 0 0 0 0 0 0 0) 
Si:, pa^utfy $.>in>'<)n!+#\zMW?z 
&*© v $?7. * 7 * -fex ©&* tr t v> e. n 

3. 18 V** £ k-**- r ? © J Ei>a-;i' 

3. 18. 1 \ZWV*-U1-<1*f 
K*-fc;W-*f:M -tf 2 4 6 teA#-r >*7x-X*-f 
y?2 5 2*>£©x-*XHJ-A©7r ; t'7.*J&56L 

772 50 (C»tt$nS. h U -Afc*ff *«* 

hU-A*±T-^AX2 4 2*5^ttJ PEGjf^ft 

$2 4 1 vzw-t-ui-iW®^- m 

M%<D C B u s -T 7 x-7. IC«t o TSt^f -5 £ t # 
T£S. tf*4r;U*-#:M1f2 4 6iaPO_CFGiW 



(a) 7< H^t-K : fcf*-fc;M--#:*"f if 2 4 6*< 

(b) -y-'ry^^iV^-V : ATJx-^lirtflBF I F 
Ofc*Mfi3fl*J:3e£?), KifW*- tf^-fif 2 4 

7i-XX-f-yf 2 5 2iCT-^^^*f ot-F. 

(c) fiaiuuejftt- h : K.z j e.)\,*-a-*<iitifi&& 

MUVAy7r2 5 oicte^nT^-s-f >*-/s;M3 

(d) J PEGffifire-F : tf£-feJl/*- ^-flf 2 4 
6 tfHIftx- * &M C U ©fSTM U VAy7y Ktelrt 

H. 

(e) S&94dE*J:IflM&a%- P : tf*ir;M— 
#±-1^2 4 6atfT^J&Sc£MUVA-;/7 7 2 5 0 »C& 

*rtt, (fcg-ci&ftwax-^/ix 2 4. 2 cfc^nse*. 
[0 3 8 6] *r*"rif2 4 ett^x-^/'? 

X242<fcJPE GflMHbS§2 4 1 ©W^t fc©fjfe© 
fcje>fcMUVA^7r 2 5 0£<£5. -6^ra^lC*V» 
T> 4 >9-/V\,&£Ttft&T-7)\>\ZUX)V RAM 
2 5 0ICtt>T»ifft$tl, 3 6Ky>Of-^ (4^)© 
*?-9 t +*;W x (4t!y h©>f >^-n;Hti8h* 
yh©#»<i) tbT7^i:XSns. H&ggM3J;tf 
ftii*SI^©fc«)»r. MUV RAM2 5 0ttff^J«S5: 

*5«trm»-rsiBs^x-^s»iwrs. €sscf?^i« 1 6 

fifX 1 6?lJlCtI^$n. £&&©f@ttft*2 0 b> 

MUVRAM2 5 0(4 1 i7 X3y ifti'f i? b&lt*) 
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#^-fif 2 4 6l:J:oT7x7m^ 

[0 3 8 7] JPEG]£ffil:^T, \L^^Lh^-1i^ 
-f if 2 4 6 tt, MUVA77r 2 5 0 Sffi?TMCUS 
^7'M7 7 7« 0 J PEGE»<2liffifa±CDfc*l;: 

tt. ^y;i//i^7rtt**fi652:t39*aiLVi. muv 

RAM 2 5 O01*»ttAM>^7z-X7< yf- 
2 5 2d^<3D7 ? -^€rffit>T»*5i*n5. fe 5 

-l5<D¥ft\*. J PEGMK2 4 1 \zm^t : f- 

$ fz£>\z \f.$ ir;u*-*X if iCcfc o TSt^tH 2 

n^o e^-trju^-^-f if 2 4 6tt, ifjBtsns 

[0 3 8 8] tf^Jkt-tf^'fif 2 4 614, H 3 2 \Z 

«9»ffsfc£*trA*x-*©7*-Vy h^fcfr^o 

[0 3 8 9] HI 0 8\Z&^T, t^-fejl^-tf^if 
2 4 6 DftUfcRWr*. K*-fe;M— if^-f If 
2 4 6tt, CBus>f>^7i-^M8 0 1l:^ 

0, C B u s 7 8 0 1 11^ P-M 

CBus £fe*LT#*«fllffi2 3 5 fcftttSnTV* 
3. \£t7W*-jJj->f'*f 2 4 6fca*^7>H7xy 
^■8 0 2*«*SnT43!K tf^-fc**-if^>f if 2 4 
6 *«i»»fc > Fx~*£A*K >^7x-X 

X-f y^-2 5 2*>SB*r$. , ^7>Hf"^^ 
^-l^Pl'XHU *triS»fc'feyhaft-6PO_ 1 _SA 

iD^x^iaoTi^ns 0 po„said^ 

X^tt, PO_DMRl/^X?OLlfv HCtSiBfifc 
l/X#O^I4PO_CDPl/^X^fc*ftan, A* 

s«f»t»ln*n«. f-^wuv ram 2 5 o \zy 

xymtlbtt, 5r-*©Stt*7-fey httPL_M 
UVW^A^JCioTJt^^n^MUV RAM2 5 0 

[0 3 9 0] *^9>H7xy9 : i8 0 2£J;oT7x 

^^$nfc^-^r>->^;^A*x-^*A^y7rT^fc 

F I F0 8 0 3**JflV>&n«. x-^*f^«8 0 

T^o x-^Mk*«OW*ttMUV7l 4 UX4**8 0 
SfcCAStt*. MUV7Fl/Xft*«8 0 5Si!tl/ 
^X^CfcTf-^^MUV RAM2 5 0, ±f- 
*/«2 4 2, it>feG«F«ft»2 41©£%6*fcfi 
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Ttt, i^rCOBu s^f>?7x-X±TOi«fe««t 

[0 3 9 1 ] H 1 0 9 Kl*5t>T. H 1 0 8 <D^<y > K 
7x^98 0 2 ££0nM\ZBt* t^7>F7xy 
5 L «8 0 2(Ctt. *«/U7Kl/Xt«ffl (I AG) 8 
1 OA^ittT&tK t^7/h'T-^^7x7^J: 

10 *#a«8 1 1 km en**, ^Mwm^s lutTFu 
(Hi 0 8) torasftasUT^o, B§*><ygjR& 

A^ (MAG) -f >^7x-XX<^f2 5 2\Z%Z>£ 
ifcLT^S. B*ff»«8 1 lttB**ft?fc»<0tt 
«««**>i/ri>£. Ctltt, F I FO#^>hg&8 1 
4*fflV^TF I FOOttlBfc^-^U *©8**VO 

$8 1 2141 AG 8 1 0©«f«*SB-»D, A^-T >^ 
7x-XX^f7f2 5 2**U^->-r§#*^7>Klr 

20 ^tswa&fc/vr hsiffirrs/w h-r^-y;w^> 
8 1 6 k *-^j^^>HHar§ 

*^7>Kx-^ti:fefc:F I FOfcttJftSns. MA 

8 1 1I1MAG5** I AGS*«fcOflifeUTSaaT 

[0 3 9 2] HI 0 8fc43V>T> M U V 7 F l^X 
8 0 5f4ftfeSV^<O*^0 ; E-KlfftfP'rS, iin^o 
^-Pfcfc^T* JBlttJPEG (JESS) t-FTT* 
So d©*— KTf 14. J PEG£E«©&»©Aax-* 
30 ««7 ? -^»f^*8 0 4fcJ:oT«a&atl* MUV;\y7 
T2 5 Qlty7}My7ytLT&t>tl&* MUV R 
AM2 5 07FI/XtriE«8 0 5l4, f-^*ff«8 0 
4 IC J; t> T&g $ tiftXti * ft Iftlftf « iCSr * M 
UV/t^7r©7HkX**li!E-r«. MAG8 0 5fcL 

»©K*IHU7HI/XftftjEW4&tt^fc» JPEGJE 
•«©8x8^n-^«»jErr*J;5t»frr*. ma 

G8 0 5t4, MCU*«BHkt— «fi&t5TVi«»-&feft 
5o HUOtt, MAG 8 0 5 fj^froAy^yf >ifW}tf?<D 
40 —011*5*?-. 

[0 3 9 3] #iiOtf^ir;Ux-^JC*ViT, MAG8 
0 514, 4OC08 fc> hRAMCOMUV RAM2 5 0 
fc*»-BHi;7l 4 l/Xrtfc, 400*5-ri»tH*it 

MCUf-^«fel:Al/Jk>7 h$nt^e»Mu 
V RAM2 5 0l;tejft$n§. x-^OfelC^>7 ha 



( 
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4 ?\ *;u-r >^ u j p e g^e- k tc^s v> 

P E GBISh^: - K 1- >T . M A G 8 0 5 (111 0 8) 

a, j PEG&mm<Dm.m$Lm<otz$>{z3 2 hf 
-^muv ram 2 5 oic»ttsn§wt^y-y-y-> 

-irJKP + T, MUV RAM 2 5 0 \Z$&$\'£tlZ>f[z - <■> 
K&3rt£n&o *©4^©A*tfi7-fe:JHCfeViT, 312 

-p&h^Tm&zti&o mi i2H 

[0 3 9 4] »*ii*^ r n-feX*r«to : rA*MCU39«M 
UV RAMK*ifl£ftTV>5IH* »*Ul/yDtXtt 
MUV RAM*^8X8^Py**K*BT. -/KM 

^ttTC^iCioT, Izg^CO^ToMAGS 0 5iCi: 

*t:*v>T, IMrtsnsx-^ttBi l l \z^-?£o\z 

HIL^ntXttMUV RAM^£x-^£&frt&# 
SK#tHT. 011411 £tD£?fc:/ni?x©-0«£ 

#fflLS>^>X^2ftT:fc?K MUV RAM2 5 0CO 

*^wi^ra^ir*;i*6»oii*rai*fcii*aT 

[0 3 9 5] MUV RAM 2 

5 G BK >^-A;^i:^M^^T^+^ v *a 

£bTJBV>6fU MAG8 0 5li^*t^r>aOW 
«ibT*<«k3fcfcoTV>S. MUV RAM2 5 0 
a3 0©*9-^*;H[ft*iryS/aT*. 

J:tf4««t««T*. DMUtai;fcftK^-fe;WB#fc 

£^T, MUV RAM2 5 0A^W1E4lft»*&J6K 

MAG8 0 5*««bh& z.<bmm%nnx^hz\z, 

MAG 8 0 5tt>tWT^S-f >^Wi;P*i«iMS*t* 
7xy?-frJ:fcV^5^qEUIlHWBl/»**WT. »«© 

??T*¥ft0*vbDfc, ^Hfcox^Mjft^xy^T 
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MAGJ4MUV RAM 2 5 OltftttSntV^tT© 
fT*iJ&&£X*^>T3o H«***±tf«a**^© 

jtS*II*5fcoT, MAG 8 0 SteT^O- F7.z -y?65 

(0 9 4) t;txhS*o*iff?dff*t* 

[ 0 3 9 ?! HI 1 Sttfe^T, HI 0 8©MUV7F 
l/Xli:.^;. MAG) 8 0 5 *J:Dl¥ffl{C7jVi 0 M A \, 
8 0 5 I B u s S3c£^m{bT£ I B u s S^ya 
10 -;U8 2 0 ^itK.T4oO, IBus^lKMi 

2(C<fc^T£j££n£. iKBfisftfcL S^^^fTT^J: 

-k^^-^^-fif 2 4 6tt!i«*8L ftSHit*©^ 

S^l^C^-HTfftfPTSJc^^ft^TV^fc*, « 
9MB8 2 1, 8 2 2©BtUM*«I6&V^tl:a: 
£o IBus8* : E?a-^8 2 0tt t ^7>K7x 
^^ffl^©»**t)5ST§©fciKS&^X h7 FUTst 

/ixhfist H»TSh!^-fe;^-^ 

20 -f1f*^»tBT§, 

[0 3 9 8] J PEGfH»«S8 2 4tt> JPEGf^ 
frMmfflt J PEGtt*HiUS»lB«£V><5 2^CD«I8« 
fe**«7LT#tK J PEG^-K^iS^Tffifens. b9 
E-acO*(I1Bffl«^(rff»TS«t5lc&oT*0, ft 

[0 3 9 9] J PEGJE«IMEI::fcV>T, DMUI4MC 
. Ur-^^ttliLMUV RAMfcJMfiT-5. JPEG 

U>^©ftmtftflSUT*0, JPEGWMMHUfcW 
30 «»4Slt/^ >^*aSTS, *¥Af-f >yi4DM 
Uffl**^±TS2li^J:oTfTton, ^SA°x>f>y 
I4KKWMHU Vtz 8 X 8 7 D v 9 fem*R*fflT21 £ fc 

[040 p] ;.p.EQ%ea»tm«it.. v-x,p&k: 

CU^MUV RAM2 5 0 \zm%&$itl?ttg\Z, J 

PEGteaaiMittww^^tty ht$** 

40 ttW)izyh?&Z.t\zi-DTs MCU**lMWX&Xy 
yjC*****«V»l4H*©««Xy5;K**3^**T, 

iitt/t^>y3&«ifeBT**35^1ilft©a«©MC 

[0 4 0 1] J PEG»€rii*«HWBI4DMUta*x- 
?&hyy*>>fL, DMU|ti#x-^£MUV RA 

k2 sofcte^ts. 



( 73 ) 



11-85969 



143 



144 



[0 4 0 3] £TCDMCU**MUV RAM 2 5 0 \ZW 

£&*tittt$\z* ffietunsHMcuinft^j peg 

-RW- I PClo>* Wkl PEGg^ 

[0 4 0 4] IIOfHffllffltt, aS^MCUtfMUV R 
AM 2 5 0(C#Sii*nfcft, SLEEPW^IIAO^ 

[0 4 0 5] J PEG»(*HibM1B«H4* MUV RA 
M 2 5 0(CfeiM*nTV^MCU*^8X8^D7^* 

[0 4 0 6] Z.<DM9t&\t J PEG-RW- I PCfcJ; 
f***Uit5, iilAf>f >^IJMUV RAM 2 5 

[0407] m^mmm^s 2 i«i bu s*>&#- 
*;^>f x^y^*R*mu *-*;k^^smd 20 

P 2 4 2fc£A5. -tbT, po. lenl/^m 
[0 4 0 8] *-*M*^0il*7iyfSWSl 

kik ywmznz* 30 

[0 4 0 9] K^-fe;m-^-9->^5 8 2 514, -WHftl 
#A*7-F<Z>1 A>f Mc#bTlMFr*-'3©Bi;3 t 

tO 4 1 0] MUV^lftt^aHl/S 2 6t4gft7£ 
x^^T***(lB«*6MUVK*Hlb*J:tf»ea* 40 
ffi#ft»K. MUV RAM2 5 0£ 

MUV RAM#£&^7Fl>-;U4MUV£M; 
^^ol-;I/C0 8H-; M/5?X^IC*IWS*ITV^. MU 
V RAM 2 5 0 ftJBV>«ffl««tt2fc©MUV RAM 

to 4 i i j MUV#^7^A%^d-JU ^ 7 life 



RAM2 5 0K#V>T#JfflT£5^*ft£-r5. 
^irA^Ev 5 3.-^8 2 7HH»f 57 KUAfc I Bu s 

-t7SftSi:, MUVW247^1rX^>^-^8 2 7 

[0 4 12] 2 9 tt, P^SBe^ir^H/ 

^yz-i!^OA^7-!-avLT , .^^ 1 A?J 
AMJ-AafflhSftSJ^Kft:*. PBus-f>?7 
i-Xt5?a-^8 3 0lt VLZ&Vit—li^ if 2 4 
6*if-^/U2 4 2£J:tf J PEG}J§K2 4 1 

3. MAGfH®lg($8 3 ltea^Wt:/^^- 

yx-r h r zmn t *, * v v w >r t 

££fi£T£o ftfe* MAGBM83 1(4, if-?A 
A2 4 2£cfctf J PEGflffftS2 4 1*>S©A*PB 

[0 4 13] 3. 18. 2 MUV/ty7r 
B2l;:43Vvrtt, m*T?©RW*^^s^ftJ:5fclf 
^ir;kt-#^<lf 2 4 6ttMUV/t-;7 7 2 5 0 <hffl 

[0 4 14] S3>7^*al/-ya>Bli&MUV;t 
^772 5 0(4W^^i7 7^7 P T— y^-H 

po) , ^a;p^^7^^7 L -> r ;^-K ge-h 

1) , JPEG^E-H <*-F2) ft^trtl^rftfta^e 
-FW*-HTV^. -tn-^tKO^-PTT, Ay 7 

5, HAtf, Ay7rKtettsnTV>£x-*7-F\ 

7>f >3>RftEfeMUVA^7r 2 5 0 fcfett 

Snfcx-^^x^ M4Ay 77©t / <l/ to -f^ > 

So 

[0 4 15] 7<DT-*£mZmLrzt)RZf 

-W?x?h©7*-7yf, Ea.kSex-** 
77*6t*RSn§©^ Ay 77 

to 4 1 e j Hi i %m^>n^i^m^m 
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j^^U-'sa >pJfi6&MUV/\y 7r 2 5 0 HX>3 
-^1 2 9 0, 7H/-5?f/H^l 2 9 3, 

129 1, 7HUXK^i&*- n-x->3>fi#«4 

■SU 2 9 2fr*>ftZ>. X-JlT" ^/h'J-Ai 2 9 5 IC 

h^x>n-y 1 2 9 o {z^o^m^-^iz^mt 

£ft* rt«f-^HU-Al 2 9 6K:EB3ft£. & 
2 9 3tCfettSn§. 

[o 4 1 7] »asnfcf-^t^xi7 h*«*t{tf 10 

-^j'a i^-t^. iV *v '. i/L V "' . 1'"- 

HJ-A1 2 9 7ICJ:0X H/-?7/HX)l^St)Hl 
£ft£o fiFtftf-^lflAX hU-A 1 2 9 71©»§ 

fbSft&x-^ttxn-^ i 2 9 i iz^^rm^it^n 

»J-A 1 2 9 8±K:Sft£. 

[0 4 18] Xhl/-^f/HX 1 2 9 3^0#£&^ 
1 0 3 5HMAG 8 0 5 (H 1 0 8 ) \Z£D$- 
tfM7Fl/X 1 2 9 9, 1 3 0 0, 1 3 

0 1feH«l:MAG8 0 5 (HI 0 8) IZ^-oTS-lLZ 20 
ft, 7 • D-r-ya 1 2 

9 2lCJ:oTX hlz-v^xA^X 1 2 9 3IC#ESft 
3„ 7Kl/Xtt*&#- r—>3>ft#3e*»l 2 
9 2te£ft, A* • Hl^D-T-ya >ft^ J 3 0 3, 

1 3 0 4*X>3-^ T3-m^nc»Ltt* 

1 3 0 6 ^ 1 3 0 7 tt*«7- 
X*54A5n^. 3>hD-78 0 1 (B 10 8)1;: 
■fcoT^A&ft**!*- 1 3 0 2 «X>3-y 

1 2 9 0, f3-^l 291, 7Fl/X|fc*&*- n- 
f-^3>fl§58tSl 2 9 2, XH/-^7/HXl 30 

2 9 3 l:M$n5. -f >^ ij * > 1 3 0 8IJ7 

2). TfcJfV\Sft* 

y 772 SO ?9Tv 7^- 7)V*- K 
HO) T**»&lc»4, *RWfc/ty7r2 5 Ottttb 

-**7*Jx.9 h«*©65^^ ; EU : £^^-;Hc7^-lr 
Xf**tfe t WffetoH*T/X »;7r £ V»tt/t y 40 

[0 4 2 0] Sn>7^f^l/- ->3>pIt6&MUV/\* 
*;77 2 5 O^fcW^Ty^f-^-H ( ; t- 
H 1 ) 7*W)ft$<Dm, 7 7 2 5 0 »4X b P-?f/t 
-fXl 2 9 3K*»aftTVi*^3o©^T'--^ 

***v^T«fk©x-^;ufc»«sft«. fcsRx-^ 
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9 5 0DTffi3/W h**JfflbT<>Ty^X^Otf 5ft 
5. 3/VT bO-tft-PftttA hl/-yf;KX 12 9 3 

^ftjwsftfcffiao^^-^i^ffsft-s. 

[CM 2 1] Hffii^J PEG^^iTl^t^. 
tt«F^{bSftfc?-^XhU-AlC**$ft-5. t^-fe 

;i/«isiift*^Mcu©7^--7y hT*Da^ft^c 

^ft^ft©MCUl4*Sctf>8 X 8 07 n y # £33^;* 
ft^o £*0>8 x8^n«^UMCU^e,jitia$ft^ 
MCU«Siffl«CDA7-a>7R-*>h, Wft^-r*^ 
j r l, G-t — r , 'J ~J'"-y s'j <j ^ .-■ 0^ : -#ii#Ol^< 
■p^©SHtC&#bTV^S. 8 X 80D^D^i7tt-tfl!)a 
7*7— FDCT (FDCT) , tt^fb, X>bDtf- 
^{b^ftS. J PEGEfigtDtt'&KWU fl^fbSftfc 

£ft£. x-^XbV-At4x>bnl£-S^{b, £1; 
-T{b, j£DCT (IDCT) fltfffeft*. IDCT&H 
©ffi*tt8X8©^Dyi7T?*S. #ftfi!)8X8^D-; 
^ttMCUSff«rt-r««fc5JCitt'&3ftS. JPEGJE 
#fft©8X8^ny^ttWS605H«c 

77 2 5 OteMCU 8 X 8 ~fUy^\Z^MVtz 

0, *SfcO8X8yayi7*MCUICS»*bfc0-r«> 
t^lCfefflir^ftSo 
[0 4 2 2] S3>7-<*jLl^-v3>pIflg&MUVA' 
*>7r2 5 0**J PEGqE-H^fflaSffoTV^t* 
tt/ty 7 7 2 5 0 b U -A 1 2 9 5 tt 

J PEGffi»ffla*froTV^t:^-t;|/*§V^4J PE 
G£E«f»a*ff o T > b ft^T 
Ky7 7 2 5 OCIBAf-nb'J-AttJ PE 

G#fiffla©*— ?v*;ux-^^n^^*5Vitt j p 
EG^fiffia^if^-fe^x— :©jp 

EGflEI60«-ett, A*t!^-feM4Y, U, V, 0©4 

So -€-ft*ft©*-03>*-*>hx-^^ny^»4 
/Xy7rfc«Hlft*ft&ra?ir*^©K^*^6&*'r 

-^«cj:-o«ij*aft«, fctc-cftrnt 4o*t© 

n>7^^^^-->3>Wffig*MUV/X-/7r 2 5 0** 
J PEGJEEJWHGM PEG*-H ( : E-K2) TMS^r 
froTV>*fc*fcHU M^#fiM3-F (MCU) 
J4^ft^ft6 4CD*-$-5VW41t&5 l ^*;KOt!i7-tr;U 

feyx^7rfc*WTe, ^Sc(0 6 4;whS©*-5 1 ^ 
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Y, U, V, 0©»*4^©n>3R-*>h*^ffi* Hi>a-^1 2 9 0tD-f-y3>t5©(!:US© 

rccht^, br^t;i/x-^©J4aj**T€r5, gfcsfe© ffi&4^oaso^{bx-^^5t>^i^*a«t*s 

T-^^UypfrZlD/W htettifiVZ-tJltLTMO (3, 2, 1, 0) (^t^^Wttt ^nx^£ <ht~ 

*£tt£„ St, d<Dtf^ir;U(DD-T— ya XD'fflgtt (3, 

[ 0 4 2 3 ] 0 1 1 7 teB 1 3 5 09 r . -3-^ 1 2 9 0 2 , .1 , C (2,1,0,3) (1,0,3,2) 

Oi¥fflEIl?$>5. hf^ir;U7D«y^^#fi(7)-e«, A7^ (0, 3, 2, 1) ©4^A<pIflgtT*5, HS*D— r- 
x-^7>>x^ hfn?nttXH/-yfMX 12 10 ya >fiHBffi-t 1 3 0 4 ttAy 7 r ^e^til//D y £ 

o^^fb^ns (hi 2 9) . d-x-v3>cda^$ ^v-v3>ffl3&^fisnfct*ic«fflsns. y-* 

HA^P-f-v'H^Bffl^l 3 0 3tCJ:Dft^$n >a>«l 3 3 3tt0T?»5. 2A^lffl*07 

So ^^JT^lfi7ir^7 :: "^^SAo4;W hT*o ^l/^^13 3llt ffi*n— =r—>a >$»J®ff^ 

fc£#fc, 3 2 t*y hCD4A* lffl^OVJl/^yW^ii- 1 3 0 4 t7-*^lx-y 3 >I1 3 3 3©il#?£fT5 
1 3 2 Ofc£tf 1 3 2 5**, 40©35©l:3CD?JKfc 3 3 4££j£T6o SftCOfflS*:- F 1 3 

A*h!i7-fe^oD-5r-^3>oa*?»cffiv^n«. « o 2\t^)v^yu^sm9m 3 3 2*&«-r<5&» 

t)lWtW4^WHh^ (3, 2, 1, K> t!^ir^7 r D-/^^^-K^filtit|!i$nSo ft 

0) ®&5\Zy^)],1fifflft>tlT\t*1tt'$'Z>t* £©t! ^1 3 3 4©«tt>T*llBISnS4AA lfllA©^^ 
^-tr;K7)D-7 L -v'3>^ (3, 2, 1, 0) (0, 20 l^tfl 3 3 0 tt^<bStlfcffl*x-^X h "J-A 1 

3, 2, 1) (1, 0, 3, 2) (2, 1, 0, 3) t 2 9 7 ±<0^ftStlfcffl*x-^*y vx^ h CD 4g 

4^^Uffc$nfc/H MtX M/-yr/HX Son— r—> 3 >*»*?U ffl^x-^Ah'J-Al 

©12 9 OKffiT^nS. 2 9 8±ICffl^x-^^^-TSo 

[0 4 2 4] A'77r^J PEG^-H ( : E- F 2 ) K [0 4 2 6] HI 1 6(^4oV^T> EIKTfflV^nSrtfiB 

(t-FO) , ^fi^^^x-r/Jl^-FT?** a >^JIS:MU v;ty 7 r 2 5 0 H 1 3 0 2 

-**:/$?x? htt«#o»^lc, y-P-r-ya > HI) Ttt»-M&#7 Fl/Att^»K*&#7 FUA 1 
©«ftt>^A*n--r-^3>*IW«#ft*a"r*2:t 30 2 9 9, 1 3 0 0, 1 3 0 1 WMAG8 0 5 (01 

fc«koT, D-r—va >K<i;D*fiS*£W-5. £©fll 0 8) fci^TtriSStts. Wy^Ty^r-^ 

1 3 2 3*4T*So 2A*ltliA^i/^i7^13 q&- F (^-FO) T?I47. F l/-^f/H A 1 2 9 3 ± 

2 lttftHWt^l 3 2 6£A#D-x-V3>fKflW^ fc^^UWa-JH 3 8 0, 1 3 8 1, 1 3 8 2, 1 
1 3 0 3t/-^<U-z^3>mi 3 2 3©S^^TS 38 3, 138 4, 1385 (0 1 2 1 ) fctHtiCffiS 
^fclCkoT^-TS, SftOftSt-F 1 3 0 2 -T*. ^»Jt?a-^1 3 8 0*6l 3 8 5 (012 

y*^*-F©«teJfc«3n*. o fi^l 3 2 6ICJ: HttfWSfcHCT**. Wfc, XFIx-yx/WXl 2 

t>T«fi3n6 4AAllH*OV;i^ l ^l/irlH 3 2 0 9 3 tt^«H*fc 1 r?©«|*a*7 Hl/X t 

ttA*x-**:/5?xjr h(04o©D-f->3>05 &*7Fk*©ft»©#ftjfc»fcU £n&©7 Fl^X 
tl^ftWJU WWtanfcXAx-^Xhy-Al 40 ^tV^a-JH 3 8 0^^1 385 (01 2 

3 2 6±lc^{k$nfc#*x-^^> J x^h*^ 1) fc»B-r«fc»Krt«n^y^t«Bf«. 

-fs. 0 -ctt, K*a*7 f vjMsmr f i^a 1 2 9 9 m 

[0 4 2 5] 01 1 8 tttt^fcSJTl&fflAx-** HJ 116) CfcD4*.&tU *Jlttfc*fcL&V*i3;rt« 

-Al 2 9 7fe«#ftr*7 ff 3-yi2 9 lSHS-r* 7Fl"X134 8 (0121) fc*>fi;Sft*. 

tt*£;btt0tt©0»BT*£. r3-yi 3 2 lttx &#TFl'X 1 3 4 9, 1350, 1351 (012 



>3-^t*«fl5lfcH«a:*ttT»fW*. t3-^ 1 ) tt*-.F 0 TttffifflSft&V^ #S&*7 F I/Xtt 
f-?/V;7r^J frfeG^-t* (^-^2) T?&3*§ **#*&*7l*l/;vi 3b5 (0116) fc.kt)#;L 
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[0 4 2 7] CCr\Z, grmfryZTy-ff—*?)^- 

tt^itZntzXjjT'-Zte 1 3 8 0:^b 1 3 8 5 CHI 1 

2 1) ST^T^TCO^^O fe^j.-;KcraB$t^*#ii 
in, ffot 3 ^©f -^wn^fti: l >f y 

t^'J^v^-;!- .1 3 8 0*5 1 3 

^7KIx^ttx KU—^x/U^ 1 2 9 3 KJ;0 4*-5 
n&, £n5®5S*&*7 Kl'Xte, f^StSn^y*^ 10 

#ffi£n&o *»Wic*-;Py^7^^7 1 -^;u ; e-K 

*K«&*SE&WC1 3 0 8*5 1 3 8 50 

^n^no^ ; tu : t> f a-^®7Ki/xir^$n^o 

1 ) Tf »4^««*ii*7 H l/X 1 2 9 9 f 1 3 0 0, 1 

3 11IM1^M7^X 1 3 4 8, 1 3 4 9, 1 

350 tc^n^n^K^nSo rt«tt#&*7 i 

3 5 2tt : E-K lTfttttfflSn&Vi. JPEG^E-H 20 
[0 4 2 8] H119UJ PEGJE»£fr? J PEGt 

T* yn >ffl^*ri6i§i» 1292 fengrrsa^to 

■fr@»©|ElgSHT*S. JPEGt-H (*-F2) T 
te, flt#t*»l 2 9 2tt3>tf-*>h;fr0>*l 3 
40fcx-*/Wh#£>*134 1©tH**, Xhl/ 
-?f/M7 1 2 9 3^tS^^)-^JzL-)V0\HU 30 

«*a*7Fi^x*ini[r*fce(cfflv»Tv^. n># 

h ?U y 2 % V y$ 1 3 4 0 ttX h 

^flttt^-*/W Y*«?>* 1 3 4 l<0ffl;*j£4ffirr& 

^E-HfcSW-S rt«tt*a*7K^l 3 4 8, 1 3 4 
9, 1 3 5 0, l 3 5 lmto&vKWr&znz* n> 
#-*>h>^!3y^*»>>^l4#7-fey HB1 3 4 3, 
1 3 44, 1 3 4 5, 1 3 4 7 MWt^fcftfcfiWHS 40 

n, i&m^x-^/u 1341 a^-xi* 

*ii*7FWX 1 3 5 4**|fcf Sfc»fcJHV»5ns. 

hil 1 3 4 3 tt^-.Xtt*&#7 F 1 3 5 

4 fctmran*: 1 3 5 8 t, mnmmw&fr&^T h 

1/X1 3 4 8 (*5W41 3 4 9, 1 3 5 0, 1 3 5 
1) ^tyt^a-^tyHll, £fi 
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ffl-*-rSOtCfflVi&nS^-X7 Kl/X 1 3 5 4t>H« 
T&£o <>^U><>hfl^l 3 0 8ttn>3ft-*>h 

Mf-^1 3 5 6H #-ttiEfflSx-^yDy^*«iE«' 

K A *y 7 r * 5® 0 Ml $ nfcS. 3 * > 7*n v 

C 7 3j ■> > > i 3 4 „■ ■' > ^ "J > > r ,-; ■. ^ l 

[0 4 2 9] ffl^O— r-->3 >»J1Hff^l 3 0 4 (HI 
116. ^3>#-^>h7D^^;J : ;o^.;..:i J ^c!:ai 

F#£>*©ffl;*7*>5ffi9ai3n, 
tcrt«B7Kl^X©*riEi:Hi;*ftT*5, n>#-*> 
hyDy^*r>>^(0|lJ*ttD-T-*--v3 >*7-fe^ h 

1 3 4 7$:n-g:-r^®icfflVie>n^o m^n-T— v 3 

1 3 0 4ttD— r—>3 >*7*fe^ h 1 3 5 
5 i^-XK*&#7 H l/X 1 3 5 4 ©SlO«Tffi 2 t* 
y htr«tt)#ASn§. A*D— r-ya>«»fl^ 

n§5»^^^7 Kl/>s 1 3 0 5 OftTffi 2 bf y h 
tcJ:i94^en^o 

[0 4 3 0] H12 0H Sa>7^^l/»y 3 >pJ 
KMUV/V>7t 2 5 0tC»iW3nfc¥-3>*-* 

«CffiV^nst>5 lt)©7 2 9 2T& 

A?;7 7liJPEG#fi0ft»©JPE 
G^E-H (^-^2) ^CO^, ^-a>7jf- 

*>hx"^yn^^^/^>7ric^$n, hf^ir;!/ 

X"^^D>>^«n-;7r*5®0ffl$n§o 

te, y<^U^^-;^coS^jX^7Kl/7;^ *MW 

«MftLm*»*a*7K^ 1 3 0 5\Z^X^K. 

3 0 3 «mft*€fji^7 FkX£>ftT&2 IJy hfcto 
T-fe^y h^n^o K^-fe^*9>^ 1 3 6 Ott, 

am $ nfc tf -fe;i«oB» * fc» cm n 

&o lf^1r;i/*^>^©tti*tt, ^a^7KWX13 
48, 1349, 1350, 1 3 5 lStfHt*D-f- 
5>3>fMffliflHf 1 3 0 4^nfe26l:fflV^tl^ 
HRfcH»&»7 Hl/Xtt> XHz-^t/WX 1 2 9 

tt*a*7Hl/Xtt#-3>#-*>h^ 
DyM>f^^l 3 6 2» 1 3 6 3, 1 3 6 4, 1 
3 6 5&£Htel 3 6 5 ^/W h-f >r;W 13 6 1 
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*.)WOy$<to\Lv Y2^Mv F0tt»*&*7Fl' 
X0/Vf Y4y?y$7s 1 3 6 lfcfflnens. 
^7 FL'XttHl 2 0\Zm2tLZ>£?\z. *-3>tf- 
^>h7*D7^-1'>T7^X 1 3 6 2, 13 6 3, 1 3 
6 4, 1 3 6 5i5^il 3 6 5 t/Hh^>f7?X 
1 3 6 10&&G>J6S-e»5. ^OfiHJTtt, Hi^n-T- 
->3>BBil^l 3 0 4ll *««&£{tfcUfct:£ 
Wti0yfr<Dlii1W\iy h4t¥y F 3 K J; 9 

ns, -f>^u/>hfi^i 3 o s&vpwjjvy? 
i 3 6 o&oi7'j^>h-r^fc«>oif^-fe;i/* , 5>^ 

^ 1 3 6 0titfi7-feJP^iE«tCAy 7 7*^810 fflStl 

[0 4 3 1] HI 2 IttX M/-^t/HX 1 2 9 3 <D 
flBTCSS. XH/-?t/HX 1 2 9 3H1 3 8 3, 

1 3 8 4, 1 3 8 5©30©4tfy F7-f F^U^i? 
a-JI/tl 3 8 0, 1 3 8 1, 1 3 8 203008^7 

0) CO 3 6 1:7 h07~F\ Mil"^77^f-^ 
^E-F (^-Fl) 012x3t^h©7-H, JPE 
G^E-F ( : E-F2) ££#£3 2 hcoif^-fe^* 
5W44 x 8 fc> ho*— n Fr-??:^ 

;Ptt«F^fb$tlfcA*R^ffl*r-^XhU-A (12 
9 6 <h 1 2 9 7) fzt 
X.tf, ^tUWa-JH 3 8 Otefr^fbanfcA^x 
-*X HJ-A1 2 9 6 0My FO^etiy h 7 Ic&tt 
$tlT-^ A*#- h fcWF^flsSnfcfflA^-^X F U 
-A1 2 9 7©ty FOjfcSK? h7fc««!*nfcx- 

$mt§o -Jj, hi 2 wz^znzt^v^y^- 

;p©tt*a*7FWXl 3 8 6, 1 3 8 7, 1 3 8 8, 
1 3 9 0, 1 3 9 1 Httt»&»7 FVx**B 1292 

en, »z:o*3ifl[)»*a*^»^tt-r^T©4ify 

a&icfflv^en^o 

[0 4 3 2] HI 2 2ttXF^f/H^ 1 2 9 3ft 

u i t^a-;wK:7^-feX'rs&»oK*.a*7 f 

1/X 1 3 8 6, 1387, 1388, 138 9, 139 

ffcsnfcfnfnox*f-^*^x? f««»« 
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Utya-^ttM7 FUX6tmcr5fc«)ic** 

ttKD^£a&K&^. tt*a*7Fi^xaa 

j&sfiKgfctK ^E-n-enoffla^-Ftttsw-sy. 

*al/-ya>Plll&MUVA77 7 2 5 O^^r- 
^X F'J-A 1 2 9 SrtO-T^TOAW Ftt J PEGJE 
SgOJPEGt-F ( ; t-F2) w\7 7r(:fei^n 

T^5t^i:jux-^^ej*iii$nfc*.f43>#-*> 

Ft-^» *5W1J PEG#fiO J PEG^-FOA 

io u/7rrt»c»ittsnT*-3>7R-*>hr-^^ejBi 

ffi a n fc tf -tr ;i/ x - ^ ^ * 7i < ' : /i & i- B £U 7; 
^-t? iztt? 5S*ttA -;77^40 ©tt*ii*7 F 
I/X 1 3 4 8, 1349, 1350, 1 3 5 1 O^figlC 

(t-Fl) fcJS^Ttt. S*3O0«*f-7WA 

1/X 1 3 4 8, 1 3 4 9, 1 3 5 0^3OOMf-y 
;HC-Y>X'y^XftOtf«fc»fciKBTf*§. t<TO 
* * U ^ >^ -;KDtt*i&*7 F UX tt#-^ 7^77 
20 7^- yOl-^-F (t-FO) CDS^fcHDTfcD, K 
^&#*7 F1/X2 4 SO^CO^-FTfflVien^o 
0122 KSSnTV^IMWlHllSoWttX F l/-^f/^ 
-fX 1 2 9 S^Mt^eotD^^U^ya-i^n^ 
n©tt*a*7 F l/X 1 3 8 6 - 1 3 9 1 &§|-fff £fc 

7Ft^X*fflV>£. tt*a*7Fl^XtriE»l 2 9 2S 
A*«^£bT*V«7FPX/tX 1 3 4 8, 1 3 4 9, 
1 3 5 0, 1 3 5 

K X H/-^t/HX 1 2 9 3^^^^';^ 
30 a-;KOrta5tt*a*7FWXl 3 8 6, 1 3 8 7, 1 
3 8 9, 1 3 9 0£4j£-r&. 
[0 4 3 3] HI 2 3W*y7y 2 5 0 JPy £ 7 

7 F©frH*fcWXy 7r 2 5 0 £«M3tt*0ft>fti* 
UfcBT&S. £0»3\ r-^t^x?F^> 

x>3-^>iraai«fftona:v». fr5Mi«tt8Ky 

F^€U^a-;H 3 8 0, 1 3 8 1, 1 3 8 2 \Z& 
?a-;H 3 8 0fc*»StU tfy F 1 S^etiy F8 

et> f i 6\t**y*z?*-)iri 3 8 2oT&4h> 

;i 7 7 r nfcx- * *:/s?x ^ f ttwia t>a 0 
aan^o #-jp^^7y^T— ^t-Ffc*^*, 
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-eno^Sx-^Jua 4ifyh co< ami <h 8 1: 

^^JzL-MzW^ftZ* :^30^r- 
7>1 4 10, 1-111, 14 1 2l^ : EUA>^ 1 3 
8 0 t 1 3 8 3 , • : : 8 I .... 1 3 8 4, 1 3 8 2 1 3 

8 5 \zfe%H2tlZ> 0 #«i&£fc*Wa*J»«^ 13 0 6 

> ] 3 0 7 (HI 2 1) te7sh\/-i?T/U7,\Zfem2 10 
. . ^ jv&foL . ... :X r iz-vr^i'A . .. 9 
3 \Z-i >^-/\ + JH^t^^Ou^^^. 

[0 4 3 5] HI 2 5Hh!^i!Ji'T-?yP'^^- 
Bfcx-^ny^fcSHrr* J PEGt-F (*-F 
2) ©ttJB©B3>7^*i^">3>nIflB&MUVA 

*ns©^*SbfcHT*§. X h Iz-^r/WX 1 2 

9 3tt, 8 tf^y h^^U^v^-^^l^m^i*^^ 20 
UTftbn^^^U^^a-iK 1381£1384£ 
^fr^U^v^-JH 3 8 0, 1 3 8 1, 1 3 8 2, 

1 3 8 3, 1 3 8 4^565 4*3C98 tfy M^U/O 
^<hbT&cft£n<5„ y^Hy^H 3 8 5teJ P 
EG^E-H (^-1*2) TttffiJBSftfcV*. Z2¥yY 

[0 4 3 6] HI 2 6tt¥— 
§X 1 2 9 3 Ki'OctifcbT*-^ 30 

mr&Z>. 7M/-^tM7 1 2 9 3(4, 8Hyh* 

rE^a-ik 13 8 1^13 8 ^fr^D^^U^^a- 
JK1 3 8 0, 1 3 8 1, 1 3f 8 2. 1 3 8 3, 1 3 8 4 

*. ^^EU^a-JH 3 8 5*4 J PEGt-K 

H2) r«t«Bsn&v^ 3 2tr^h©«F#ft;3ftfce 

^ir;H44^©/H MciHlP Stu **H?hft*»&:* 8 

§ 3 * > f y D v ^ im.n yyy \zm$& * ft* 
jgfflSns. 3 2lfyh©fff*ft;StlfcK^'feJp7 J -^ 

to 4 3 7j J;biW&Hd>7-f*il^— >sl>^« 
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*^**;kz>fe3fiB*« (csc) tc*ws*8k<o<> 
^-/\W77^ya >^7™^*/^y7ri:» 

J"PEGjE», J PEG#^tl-entt4oV^T, MCU 
7-^^8X8 ©iji-O >h^U^ 
*5W4 8 X 8 ©J£-n>7R-*> h^D^^^MCU 

[0 4 3 9] 3. 18. 3 ;tf:M-!f 
MUV/Xy7r2 5 0ttJB**-^-flf 2 4 9JCl3Vi 
TfefflV^sns. «**-#:Mlf 2 4 9tt, 
-^/U2 4 2*5W4J PEGH-^2 4 1 ©X h U 

■!f2 4 9tt*&, B4 2TK9!bfc**x-*©JEIfr 

H«f«. EfclS*:*— #:J-f1f 2 4 9(4^SK>^- 
7x-X^>hD-72 3 8, n-#;M^lJ3>MJ 
— 72 3 6, JS2H >^-7i-Xj> hD-72 3 7 

[0 4 4 0] J PEGfr^E-HCDBt, l6«*-^^>f 
if 2 4 9ttMUV RAM 2 5 0^J PEGH-^2 4 
9<DH^X-^^^ , /;PA7 7r'r^fcfelCfflV^o ^ 
^JWy7rl4MUVRAM2 5 0 *h 
TV>£ J PEGa-^2 4 ! C0x-^£/f!VvtJ PEG 

[0 4 4 11 1. 3'"2ttf4^**W«5 s -^l*,"«- 

?>*Jk&>&©8 If? F03>Jp-*> h£^tt8 X 8 
?Uyi?G>J8<DJ PEG#*ftffoTV^Bfc, US** 

-35T^-f if 2 4 9 fc*sn*. »**-;ff:M-Sf HJin 
S©:/ny^£ii£<9JS#TMUVRAM2 5 0 

7-^«UVRAM2 5 0i^6tt*^**ffoTV^ 

^©y^^jKo^^S/jLiftHft-r*. «AS/yuvi: 

ct-53^^^C0J PEGJE8STJ4J PEGH-^2 4 
n43:2<z>8 x8:/ayi7£, IB»fcY, &K:u, 

4a, If^-fe^ft (yuvx) <DB-?mffctz>. ZZLT?X 
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[0 4 4 2] HI 2 7(rfeif5tH2CDiBS*-^< 
If 2 4 9<0i¥iffl*^$ttTl>£. te£*-;tf;Mif 2 4 

;U*^Oa#CD«*CBu s<>^7x-X8 4 03 
fflKSWfcfc^T^S. «g**-^-f+f 2 4 9C0ffl.a 
tttf#-feJk*--#:M P if 2 4 9 <fcH«T*§!^ «JA- 
Ax-^&fl^fTtonS. x-^&ffOLriy h 8 4 2 « 10 

*-*>h«*. #IE«fcftMUV7HW;**£S8 0 

ft*. FIFO+a-8 4 3^r^^fft^RB 
usBWPa^y h8 4 4*MViTtBAStl*Wfc/1y7 
r£fr5, RBu sfWIJJX-y hS 4 

AT, ft»l/?X^l:**!!Sn§. Efc, rtSBRO_C 

ftl&T, RO_LMTlx5?^^am*«|SAt+itS 

nfcftojfc^-^sffl^Ta^vK^ox-^JiHA* 

ffl*SnS^**fif§. MAG8 0 5*4 J PEGM 
WFfcMUVRAM2 5 0©7Fl/X**riEfS. MUV 
RAM2 5 OteJ PEG3-y*6©flJA*^;Wty 
7 7l~5fc»fC/HV>Sft$, MAG 8 0 5f4F*?IS5n>7 
^*al/->'3>l/> ? A^iC&#-r^MUVRAM2 5 30 

*ff5. /vr H/->x7yi;>wwt*T-^t 

•<Bte»*fc»ISr*»fcMfca:*©T!, MUVRA 
M2 5 O^SWStW&x-^tty-^Ji^any h£a 
bT«£ft£o »**-^-f1f 2 4 9**J PEG*- 
FfcfcoTV>&V>i#ttMAG8 0 5Wl;PBusI/ 
->-A 8 4 5 ©7-^f-?ISfflXy h 8 4 2 fc^ 

[0 4 4 3] 3. 18. 4 ^7>F;*-:*f:NfifB 40 

H2fcWtfKoT* 2^©a4«t*'<9>H*-jJ^ 
if 2 4 7t2 4 8ttf-^*f^>a3>hO-*2 4 
O0T-£V1?7r©*MBi, JPEG3-^241$ 
§W^f>T-^/U2 4 2fc-r-^ftlia-r*«« 
*^5>H*-^-fif2 4 7t2 4 8tt«4r 

(b) gfc<Z)f8H*0T-*fli*^7 > F l^X^Oft 50 
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( c ) t-Xi-f If ** v-y > v ;P7 F 
l^X*«t«T-^*f ^ 3 > h D-5 2 4 0(O/\'7 

*-F. 

[0 4 4 4] *»©^<>r-^U2 4 2©i!li : E" 
F14, tfe(!:-6?»36^^7> F -f if £ 

v-^>v*;U*-FT*£;i<h£g4tf5. ^^7> 
H*-^-fifB 2 4 7ir*tfs, £)fi££tr;ift'&<0 

lf^-fe;PTiKBT*S. 3*-^7>F*-#:M'ifC2 4 

8 ii*n^n©x-^^^*;u©«©^&ff 

-#:MlfB2 4 7(48 fcfy h©ffWfffS:OAy7 7* 

F*-#:Nfif B 2 4 7 i4SiKttm£:3m«^*^con 
•ft (Dr- fort y7yiff*>. 

(d) JgffqE-HTrtt^^^H^-jtf^-ftfBtt* 

* fc J: o Tl§£ $ tifcHSfcHBT - £ *tf 5 . 

(e) FTtt*^7> F*-#:Mlf Btt 
t!i7"b;^-f 7 r £fr7o 

(f) *^7>F*-jtf±-f1f 
ttr-!>tt!y ya3>hD-7lIMDP 2 4 2&£V> 
t4J PEG3-^2 4 l<07HkX^y-f U£ MClESt 

[0 4 4 5] rt«fiSt^X*tt, ^r>v-Wh * 

F^aa©***:* :t^>F*-# 

t^if2 4 7, 2 4 8©fl*T4riESft^JSi©*** 
Jfcf*. *^9>F*-*^'!f2 4 7, 2 4 8-WHf 

ft«u «^n^Ticj[ts^nfcx-^^aois^«^ 

>#-*>F©ftA, EE* * #£E* ■ lE&fctatB, fcfc 

v*uTJ:o«i*«**. Manfcuanrtai/^x^ 

ftMViT3>7^*aI/—>3>Stl*. MIC, *^7 
> F*-#:M if 2 4 7 £ 2 4 8*ft^tU4x~*JSi 

ftiwpirrsfcfcicn >7^^i/-s/a >2ft3„ 

[0 4 4 6] 012 8Tt4, > F*-#:Mif 

(2 4 7, 2 4 8) ©J:DB*lft*ri»«iSSnTV^*. 
*^9>F*-#:Mlf2 4 7* 2 4 8»4a#©««PC 
Bus^f >?-7x-Xit^7> F*- if£# 
OW»*a]-6 1/^X^8 5 0**tf. Sfc, OBusffsJ 
ffllan^ h 8 5 lt4x-^*^^x^3>hn-^ti^ 

HUXt^ *^7>H*-»^-f1f2'4 7 f 2 4 8<D 

obu s< >^-7x-x©afir*?jKfc-r*«««* 
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x-f1f2 4 7. 2 4 8*<y-y>yt;k fc-S^tt^-T 
^t-^TSStSCH, OBu s3>hD-73.--; 
1-8 5 1 Bf-^OMJT-i' + t 7 va3 > h D- 

[0 4 4 7] ^E-n-C ! no*^5>K*-^^-f-!f»4a[ 
tc. fli^fct-Fcj&aicfcivc. f-?*fr>aa 

>hD-724 Ori^©^-^/^ 7 7 Ac Steffi 
6 h'y hifB©F I FOA-y7r 8 5 2 

[0448] f-»ar7l-8 5 3li, bf^-feJM- 
2 4 6 ©x-^t&ffCi- «/ 1-8 0 4 t^jST 

[0 4 4 9] ^-f>T-^U/JPEG3-y-f>^ 
-7X-7.8 5 4«a«^a ; £-HlC*5ViT^'f>5 ; - 
^AX^J PEG3-y*i?a-^2 4 2, 2 4 17^ 
9<fc0Sn5x-*;&tf7h*kX£#E-f$. MDP/ 
J C-f>^-7x-X8 5 4(S^-^SIf^rL--; h8 5 
3 ^ Of-^^^-f >x-^/1X&tf^©x-^ 

O^lCtt. n - <;/ H 8 5 1 , 8 5 4 ttx-*** y v 
a3> 2 4 0 t$&& ; r-'?)l><Dlt6T&7iT±7> 

[0 4 5 0] 3. 18. 5 ±x-^AXaJ 

WT©*«09©*«att, ttft©m«afif?eftarcfT 

3 £ £ ©Tr£3&«&©=i > Mjl-? 7-*t>? f-* « 

*fr7«fc7»c$j5fcsns::£©T#3. *Hctto*-53 

> tfi-^ 7-*x* x V SgflirfS £ £ & B wt-r 
•6. ■«7*ti-fey»Hi. mtu*?y?&rz<2/u 

•5. 

[04 5 1] a>ga-£7-*?*?*ttM*l'5tt 

ytK 43.fctf, 7D-««a^y^*a«-r-6. Ml/ 

f-^T^t/ h-f>*-7x-.X A73 
K>^-7i-^ttf-?*^i 

Vs.* h-i >*-yx.-WT-9*7y3.i; boy H 

^fe^-rsitfefto, zmscx-^tfT^x* 

>oy-^!4*iEtV>T-^^7*^x^K*JlttTtS«k 



10 



20 



30 



40 
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WSPnv-y^fct x-^tyyi^ Y®MUi>y!7(D$> 

[0 4 5 2] mz* x-^^7*^x^ h7u-ty*m. 
VK-D^©(^-?S:x-^^-7*vx^ h-9-7*7*n-fe- y -t)-S: 

7"-^xi7 h©-«B5*aia-r*. T-9*zrvx# h-tJ-7* 

mmnmm t'&mm& t **«rr s^fifc uvvz 
fc»©aM««£Aflrr*. ^©fHiiggtt. 7n-« 

-5. 

[0 4 5 3] fc*. flreD??£*&©«tj£flrettfta 

TfTS. IO>*-XA*II;t, £©.fc5J&:*«ffi7* 

fca«7*n-t:-yy-*«*-r-5ri*oitBc-r*. *fS9J 

[0 4 5 4] B2C*HT. t9j£Lfc.k7 1;:. £5*-* 
/17s BB 2 42IJJPE Gf-^^§ft;W©iTC9f- 

*k. -fesra^ft, frw©*ff. 

V«; h©#a*^*n-5. ±x-^/t7x 2 4 2 tetf^-fe 
^-Jtf^-flf 2 4 6**tf^5>H*-^-f-»f 2 
4 7, 2 4 8*6. tf^-fe;k£^7>Hx-^i:*S 
WSlt), te*HJ7jSilSft*-^7-f-tf2 4 9»tSI«. 
[0 4 5 5] H12 9tt, =£f-^/Uf2 4 2©7*P 
y^HT!*-6. *x-^/^«2 4 2 WtJHffl©B«7*D 
ty^tioT, A73-f >^-7x-Xl 4 6 0. Bill 
x-^T'D-feytf-l 4 6 2. #^7-HW^^l 4 6 
4 . #^7- PftJfS 1 4 6 8. tmfl|^l/9X^ 1 4 
7 0. W^77-fjH 4 7 2, ROM 14 

7 5 

[0 4 5 6] tiritfflmB2 3 5tt, 1 4 5 4SSI; 
T. #^<7- H 4^7- K l/S?^^ 14 6 4 

^■n-en©#^7-Htt. nfr-r^tBUfflaiifp©* 
s^B«iffla»^©«i>!r js*7*-> a > sa^-r « 7*? 9 

ftS<E)1im«#A/ > VV»«. #^7-Htt, /tX 14 6 5 
*«SftbT^7- 14 6 8 fcg«n-5. **l 

T, #^ifel«fiB2 3 5 t4. i^^H.HSfi^si^fe 
**»7-lMitfHfei 4 6 bl^w^icfcAtT**. * 

©ik^*SiT* ti^Mi 1^ .6,8 tit^&7-K* 



50 
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X 1 4 6 9*»4bT«(»«#l/yx^ 1 4 7 0 \ZM\i 

n§, -ttus^, M»ffl^i/^^oia*tt* ax 14 

7 1 ^lTAM>^-7x-X 1 4 6 O&cfctfl! 

^x-^yn-try-y- j 4 6 2 trttsaans, 

[0 4 5 7] if-?/UilI2 4 2£cfcDf£#:14©$>£ 
^©I't^ctf)!;:, t^0HM2 3 5^fflff^l^vX 

^ l 4 7 OKSJgtf^iiifr <hfcT££- cnicJ:^ 

*>, ±x-^X«B2 4 2©ffl*^«^*fr^S«t-5«C 10 

[0 4 5 8] ^a©H«fflaftfp**fT-r*fc»^*» 

»«2 3 5«, ^©JR*T*&V^»3&:^T©»*S^ 
^^7HM 4 7 2©VKO^©attftfcl/i?X^ 
fc»*atfClt*«T?€f«. A7 1 4 7 3S 

*££UT, AM>^-7x-X 1 4 6 0 t5J:rXli«lT 

ffMc43V>T, KM >^-7x-X 1 4 6 0 20 

vrztz, #^mi»»2 3 5tti(Hfi©**a*«v^T, m 

[0 4 5 9] i^7-F©**W»*7U 

sb 2 3 5i4±7 s -^/^«2 4 2\zmmm&{&wmft<D 

A^^f>^-7x-7 1 4 6 Ott/tX 1 4 5 30 

>^7i-X 1 4 6 011 *frStt*MUft8»ff© 
SSlOSbT* *^9>F/tXl 4 5 2Xtt*'<5>H 
/U14 5 3*6©*^9>Hx-^*S»a0»»« 

9>H/tXl 4 5 2X«^^>FAX 1 4 5 3*6© 

—7s 146 OH tmti&UVX* 1 4 7 Ocoffl^jtC^ 

M>^-7x-7 1 4 6 Oli/U 1 4 5 2#£tf 1 4 

[0 4 6 0] Hftx-^n-fey-tM 4 6 2H A^-f 
>?-7x-^ 1 4 6 OfcEMLttLTfc&ofcx-^ 

*1 46 2B t ffife<Dffimyy2?-efrt>tlZ>2'0<D : r 
—?*7i?x.2 h©H©*jtfS, 2~D<D^-?*7z?3:f7 
^©ffe&, feWt©tl*ft2b6r«l*«W, 2^©x 
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\Z. fLTf-^t^xi? h©7>^7D-£&3§ 

AX 1 4 7 1 <Dfil»m^tt, S9K!©£ffi 

[0 4 6 1 ] ROM 1 4 7 5H 8. 8 7^-77ht 
«0»T6nt2 5 5/xO*|»BcSWr**«. 
T% xtt0*^2 5 5iTOftt*5, ROM147 5 
tt, MXl 4 7 6*»SLT. AM>^-7x-Xl 
4 6 0*«krWB»X"-^^D-fery*l 4 6 2fcS!«$tl 
ROM 1 4 7 5ttfiVifiS©^Ix>K**^b, x 
-^*^x^ hiC 2 5 5 ^©*£S£ffi©x- 

[0 4 6 2] 0Jjt«l 4 5 2©&te2 

[0 4 6 3] HI 3 014, AAO?-7x-X 1 4 6 
0*£DSMffl£5St\ AM>?-7x-X146 0 
tt, t-^*^i* K>^-7i-X»l 4 8 0, 
*^7>M>^-7x-X»l 4 8 2£i;H;i 4 8 
4, 7Kl/X^*tt««1 4 8 6, ^>K£riaKWB 
1 4 8 8, fr^VKff KM 1 4 9 0, fttmfttt&l 4 9 
4, 5 0 0, SM&1 4 9 6, 

* 1 4 9 8, attfK, x-**HEn$?y^ 1 5 0 5M 

[0 4 6 4] x-^^^x^ h-T>^-7x-XSBl 
4 8 0£, *^5>H-f >^-7x-xaBl 4 8 2X^ 
1 4 8 4 £HU Wtofc*>T--9*7 9 Jx.i? VBtXttU 
> F <>*-7x--X«l 4 8 2, 1 4 8 
4lt 2^£fcfWffll/ 1 U 15 1 5*S©*Mff#fc.ko 
Ti/tStl*. -f>*-7i-Xil 4 8 2, 1 48 4 
tt> S^Stofc«*t)©x-^^^xi7h/^7> 

*IEf-*I/^^)WJ»T**t«ttVAL I Dfl» 
*a*-r*. >f>^-7x-Xffil 4 8 2, 1484© 
T-fUVXfOttMtT-irrtX 1 5 0 5fcg$23tl 
§o >f>^-7x-XS5l 4 8 2, 1 484©VALI 

D{t*tfc7n-/SXl 5 l 0K&$$n£o *^?>H 

^-7x-^«1 4 8 2*J:tfl 48 4l4, g«SB 1 4 
9 6*6©7Fl/Xt» fT^J*ffttSiSl 4 9 0^, x 
-**7^x£ h-f >^-7x-XSK 1 4 8 0 ©x-^ 
W?X*<Dfflai*aB-»D> *©+T&B&7l 4 l'* 

ft*W/xx l 5 i 5*6o«wi§i:«i;TW{t-s. 

£&B**fcU4§3\ *^>N>*-7i-X«14 
^2^i^l4 8 4©x-^U^X^H 7-9*79 
±£ U ^-7x-XSl 1 4 8 0 £fctt*fttt 1 4 9 



lit 
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6 0?-? I,*?*? Otiltlfr £7*-* &»iWr-5 J: 0 \Z 
[0 4 6 5] 7Fl/^ftJ*tt«»1 4 8 6tt, 77-f> 

wmmw>ft&£tf%&*®.nmmz%^T&'mi 4 
9 6*$d®b. v-7.ii&©77t:7.$n5^#;fc©ffi 

[0 4 6 6] 7 F V7>$Lfim.mWi 1 4 8 611 U'&^X 
1 5 1 5©STARTS^I5:^Sn?.Ct^. $fl 
flAXl 5 1 SOSTARTft^j&t&fc^ftat. 7 F 
UXfefrKigSg 1 4 8 6t4x-^*^xi7 F-f 
7x-Xffl$l 4 8 0 lC#LTSTALL<t#£fi¥"£U 

7 F1/X£j»*ft8 1 48 6(1 7 F kX&Sttilgg 1 
4 8 6*t7x>y7'-T5£<h*&g<ht-3£-*;UT*X7 

7>F-f >*--7x-X8Bl 4 8 2*5«ktfl 4 8 407 
-irVi?** ttoWXi? l 4 9 8&©-f*-7*;HI* 
KfcS. x-^tyyi^ h-f>^-7x-7vffll 4 8 
0*»6VAL ID«»WB»3ft*i:, 7K^t*K 
&m 4 8 6te# ttS-fc: - . left D. x- 
9*-7¥x.{? h<Z)%.<D$lftWmi3.Z>U>>Xi?\Z7 

n*. 

[0 4 6 7] 7Fl'X£j£tt 
ggggl 4 8 6«^7>K-T>^-7x-Xffil 4 8 4 
6 -f >x -V 7 X f— 7VMI t \L 7 t )V t £ 7 x y ? b 
SS»J:i:^-'<7>h , -f >^-7x-XSBl 4 8 2 
f S. fc43, 7 Ht/;*£jftRttll 148 6I1 fr©$c<fc 

5«cD«:i: zwenm-o 2 o©# 7 * n- Hf*. 

-7X-X8C1 4 8 2&if*^©STALL«-tfc«k-p 
*a*-r5. *U*C„ gflSSB 1 4 9 6 14, 7i7fSn 

z^z&a&mzsm-rz. M#©#*>*tf#K>rr 

#*>*»4fT£?0©»£m*n-FU #«fl51 

496 \tx<Dfimv&±ia&m-rz5\zmitL2tiz. 

[0 4 6 8] K^-fe;P©«©ffi*ft)t-r-5fc6»C»WA* 
ffifens*§£, 7 F WXfliricttffiSg 1 4 8 6 »4 2 

n?»m fx*J;tf*l©gt*»4>3ii-5. 

T. aoftlHM' >?y^f- 7>7Fl'Xrt*:?M5> 
F-f>*-:7x-*Wl 4 8 2C»&*lfcfc#. ttSBS 

ttftftflreeaiu Mte? h*y«fe» F-rs. 

tO 4 6 9] 7U>F£ffctfcS88§14 8 8fcJU ft**! 
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[0 4 7 0] ^>F£j*tt«Sl 4 8 8»42S&*>© 
(^^>7* ; E-FXt4XT-7 7' ; E-K) rSEfff 
^£-e**jfc*ife«>£. 7>>Ffi£rf>"2 5 6WTTS 
S»£ttS>*>7"*-Fa<ffi)bft, *0-Ztm®£\Z7, 

[0 4 7 1] 7*k> h*£j$Rtt!g 1 4 8 8 14. TIS©H" 
if SfrW ■£©£** US^X^ (r e gO, regl, 
reg2) fc-fey FT*. 7*5 > F 7 >7°^ ; f »fc5£2 
10 nfcfiSTXf-^t- FK& •"■>«•&. 5 11-fi$* 
regO (2 4tf-yF) 512-2*I$$reg 
1 (2 4h'<yh) t. -tLT. &7-H#l£ r e g 2 

(4X9fcT-yF) K, *tlJctl7vrt%. 7>-?W 
•^>7* ; &-F»t*S«^l4. OfcregO (2 4fc*y 
F) It, 2 5 5/ (fiS- 1) Sregl (2 4 tfy 
F) It, *l/t, &7-0M6£r eg2 (4X9tf>y 

[0 4 7 2] 7>7 L -v7"^-h*t*3V^T, «T©»"1*»# 

20 [0 4 7 3] regO>0Ta&5iJ, regOCre 
g 1 £iO;t, *<Dm%Z regO Icteiffrrs. t>3-0 

©< >7 u * *-7>$n-5 c <h *>-e#-5a<» 

^•©*l#lC»4ai**Mfc*^tiJ)DSn-5. regO^OT 
r e g 0IC5 1 0 SiO*., ■¥•©*§** r e g 

oitttjft-r*. -f>i"jy>^ttfijpsnftv». -f>7 

•J^>^©ffl7J(47>7 p fil'r»S. 
[0 4 7 43 v'^>7° ; E-Hlt*V>T, £*T©$&S## 

u--r 7 ;nc*^THfr$ n-s. 

[0 4 7 5] reg0l:regUM5. Un»©W7J 
30 14 2 4 If y hT?*0. 16. 8 ©ES^>»^7d—75; 

^»is*©s i i •staa-*«nn$ 

[0 4 7 6] -f >^u^>^©S»:f»©Tfi;8ify M4 
5>7"in!*5. ei©7>7"ffl, EP^reg2©ffi^ 
FW*&ffli»4W&x-*7'a-fey-!r 14 6 2 

i:si5.n> 7>7"*^-rs. 
[0477] ftmm#&® i49on &&nm&m 

40 f?5. ieftff?!If44X5J»C5cT?**. »l^SS4?5fc 
»4r-^*7*i?x;7 F©45 i v^;i'*a-JtS«t5»tfetJ 

TV»S. fil^AXl 5 1 5^e.©STARTffi#*tgS!l 
Stlfctt, ff?Jfll»ttS88U4ttT©«fc5fcl!i<. 
[0 4 7 8] 1) /XX 1 4 8 2R(^1 4 8 4&$%&ft 
55©#^»-|:7xy9 i -t^5-r>##*4jS'rs. 

&w?*fr i 4 9 8 *-r^-^;utT^sc*<te»>i 
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(D^\Z»n^tl^^h<D^m^~r^ "MAT S E 

[0 4 8 0] 3) f-?t^i?M>?-7i-^ 
SSI 4 8 OA^A^SWr-^^x? htffc^i: 

[0 4 8 1] ttfflttttS 1 4 9 4H x-^^x^ 

SB2 4 2tt/^ 1 4 5 lA^r-^t^x? h'J 10 

^UT, tgAMJ-A©2«* Xtt4«Ofi3T» 

*t*o#*fc», ttm«tt8gi 4 9 4a, xw^h 

5. ttfflttfiSi 4 9 4ttKT©J;5K:»frr*. 
[0 4 8 2] 1) I NT_SELm#**i*"f«^i:lC 
<fc^T, x-*E#n$^£ 1 5 0 3**A*x-^*^ 
^x^hSSEM'TSi'SCb, iEbV^-^^^x 20 

[0 4 8 3] 2) IB^TSx-^^xi? h*t©ffl£ 

[0 4 8 4] 3) f^t^x^h^f>^-7x-X 
ffil 4 8 0*«t)5'r-^*^x^ h*3WAttfcV>± 

SSAtt* W*AHJ-A#AaxHJ-AJ:D«V>* 
e»"C*^o STALL»#I47P-/U 1 5 1 

n*. 

[0 4 8 5] JMW1 4 9 6H, #*tWftfr5ft«6© 30 
EIS**fllbT*D» M*/tX 1 5 1 5©M*MH»iCJ: 

©^KJioTtttotlS. 
[0 4 8 6] 77>f >H««ift*J:««a*«WK:*Vi 

[0 4 8 7] 1) *©x*£tfy***f|'**r*. xj& 
**»JW«&a6fc, JHHK1 4 9 6ttftlftS*ffiVrr 
gift© x *«fc*¥* i^Sitx^^ © x jsfe^ftttl*.* 
*\ l«*tfliV»T»fiE©x*il*6*¥*J:r«ll[T 40 
^©x*»*3KJ:5K:-r*. y*S«HW*fc» 
fc, f?fi?8B 1 4 9 8 ttJni|[»ft«V*Ta«©y *«K* 
TXHtattT^CDyrt^ftlPA**, MgSg£fflV>T 
Sfi© y £ft* 6 *¥MIIf;^ © y Jfc5*£ 31 < £ 

[0 4 8 8] 2) ytfilft^^xy^XT— 
y HCiOgb-f^x^Ax- ^THl/XftfMW 
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[0 4 8 9] 3) xjSgH>T7^Xf-yjI/X>h 

UfcjnlJtu ^-fe;p©7Ki/A 

[0 4 9 0] 4) §£#»>>h*6 1 HK. 

[0 4 9 1] yix>Ht«*C*ViT, JM&«1 4 9 6tt 

[0 4 9 2] 1) AxyT^-FKlfc^T, 
5 >yjnffS£ffl^T7 >^4J«7;U^f U XAcoftSB^ 
ftftffMTr*. *©ffi©lo©Jn»8ll -f>* 

[0 4 9 3] 2) >?^>^ ; E-Htr45ViTtt, 
**Sft©5>^fflfclfl^«fc»tC10©Jn»«©**« 

[0 4 9 4] 3) >?^>^-HT?tt, #»©#J0J#T 

[0 4 9 5] 4) 5>:/£j*©ifi»fc*&oT, 77> 

[0 4 9 6] 5) fiS#£>h*SH&§|<. 

[0 4 9 7] ttl^A* 1 49 8H x-^^x^ 
K>*-7x-Xil 4 8 0,' ^7>hM 
>^-7x-XSBl 4 8 2&tfl 4 8 4K43^T, X" 

? 1 4 9 8H rt«***Wrt-r**» SftttT-^*^ 
i?x^ H>^7i-X8 1 4 8 0*6©il£©x- 

#i!T*£o I^X* 1 4 9 811 *fflJA*X 1 5 1 5 © 

[0 4 9 8] x-^HJHflU 5 0 0H W/U15 1 
5©M««*fcJ:oT*MfcS*l*. x-^HWBBl 5 0 

0 HU STALL«**x-*#:/5>x*h>r>*-7 
x-Affil 4 8 0, MtXlC, ^7>HO^-7x- 
XfiSl 4 8 24sJ;tf 1 4 8 4fcfttt-r*ZlfcKJ;oT, 

>*-7x-A«*l* «©>f >^-7x-AttJ* 

fi&©< >^-7 x-^flD4T*T-> £SttSt5 

©-r >^-7x-xaB*<?±$^« 

[0 4 9 9] x-^E#nS?y^ 1 5 0 5H 'ff«*# 
(ttttil 4 9 0*6©MAT_SEL«*t, ttWttffi 
SI 4 9 4*6© I NT_SEL«#t**OW«/tA 

1 5 1 5©ww«#fc«;i;T, f-^;u 1 5 1 o£<t 

^X^7 7^iH 4 7 2*&©x— **:/S?xir h 
fc/tX 1 5 3 0 fcttfcUTBEWr*. SE^Snfcx 
-^tt/tA 1461 'sffi^SftS. 
[0 5 0 0] 01 3 1H 012 9©p[»x-*;/D-fc: 
ytM 4 6 2*«tO»«HKiS"r. ■•x-^^O-fey* 

1 4 6 2H /Hy?-f>W»l 5 4 Ofc, 
y-^^)VzfU±y^l 5 4 5, 1 5 5 0, 1 5 5 

ti-byittl A^>^-7±Mi4 6o (feti3 
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1 4 7 O^^^M^ff^irJ:oT^$n^o ^TOA 
^-^^^^yn-fe^it^, HI 2 9C9l/>>X^77-r 
4 7 2S^R0M 1 4 7 5 7^(Z)A;£/£ *>AX 1 5 

y$nr/\*X 1 5 7 0 Hft^-^D-fey+M 

4 6 2£>£B?J 1 4 5 5 SrJgjSfS. 

[0 5 0 1 ] /U^yOUmmi 5 4 Ott, £TO# 10 

^^y^ymmmi 5 4 ootcn i^x^/vr:/? 
>r /wy^-f >©*iB&^*sa, AX 14 

0, 5 4 ozvvry^-r^*? 

;M^7<>«Mii/w 1 5 6 5*ev 

ALIDiWS&W. A<^5-f >*»»«1 5 4 0 20 

co/wy^<>XT--> ? ^n j ?nic43v^T, a^val 

ft*>5, yy^OttiA. W*>, VALIDflftt, % £<D 
/U7y-f>7sf— P\z&&. c<D£v\zLT, ;U7 

[0 5 0 2] tiy-^^^)V^U±y^l 5 4 5, 1 5 30 

5 0, 1 5 5 5> RZfil 5 6 OH, A^x-^^x 

S?x^h^S*4 0^^^;i/*JtoTV^fc«), #5 

[0 5 0 3] *5-f t*^nty»OtCtt, K£ 
t*OT»H (o p a c i t y) ^ir*/Hfe*Hrr** 

01 3 1 fctt«anTv»ftv»if«, mw/xxi 

4 7 lKiKttStlTV^»tDOHB^ft9. tiy-?* 40 
* a - t y U-tt^»!!^ v **ft iE b < fflsr « J: 5 

1 4 7 i^eomft^esa-r*. zti 

[0 5 0 4] HI 3 211. *9-***/k/n-fe5r»i 
54 5. 1 5 5 0, 1 5 5 5> i 6 -6 0ft (to 1 3 2k 
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Oy{7B 1 6 1 5 t, fcf^inWSl 6 2 0 t, 
D»T*16 2 5fc, ?7>ytfctt5^-l 6 3 0 
6 3 5 ££ffl*TV>£. Uy-T* 
6 0 0H ftWUf^l/^^ 14 7 0 
Ji^OJIBIt^S/U 1 6 0 2 *84 LT, /H^^ 
>*JflM8l 5 4 0*^©-f 16 0 4 

7, 1 6 0 5 £g£bT, -?-<0ffi# y-T**)V~7u*. y 
Vft*><0T-P*'7i?x.9 h*/tx 1 6 0 3 ^Sfib 
X, A*<>?-7x-Xl 4 6 0i^©f-*t^ 

x£ 16 0 1 IT, -tti-ensw-Es. 

[0 5 0 5] fflS^Dy^Al 6 1 AX 16 0 1 
SfrK ^Wl-th^^nfcx-^^y^x^hftAX 

1 6 1 ifctb*rr*. ai^py^Ai 6 1 0**0*40. 

[0 5 0 6] £-J*K43V>T, ffli?P7^Al 6 1 0« 
x-^^^xi7 h/\*X 14 5 1^^0f-^t^x 

fil£<DfSJ£01 2 9 0A*0^-7x-7 1 4 6 0^ 
&©*W7r^^fcJ:oTliraU H12 9©*^7> 
KAX1 4 5 2a*e^*^5>l**;/l/**r£*>£fc 

[0 5 0 7] — iRfiemj6*lC*5V^T, ffia^n-z^A 
16 10H 012 9 COAX 14 5 l*5®2:3©a* 

[0 5 0 8] 77^ >B»««*i:(tf«a*SWfc*Vi 
T, fil^DyjrAl 6 1 0 ttV»X tf^irJPOfil^ 

[0509] ttHfflm*afc*vvr, grar/n? ?a 
[osio] *wtm&s**mifc*v>T* jsa^ny 

?A161 0fi29©T-^*^x^ ^B4i«t 

[0 5 11] lx^a7;PV-^>fc*V^T, fil^P 
6 1 0(i2'3tiE)7 f -^^x^hftjkfft a 

[0 5 12] fti^jrA16 1 0t*&k<D9m&7 
D*;^ 1 6 4 0t, «i^Py?AWIr-D^^16 
4 5 ££ffi;t£o ^MR/D? 1 6 4 OtttflftflHIC 
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[0 5 15] 2O<0x-**:/yxi7h©ra£*£HIH 

[0 5 16] fifc^aWftS^l/RJM-S. 

[0 5 17] 20(Df-^t^x^ h*»tt, ^COffl 
\zm 3 0r-^^?i f h *»Jt«. 
[0 5 18] 2^f-^*^ii7 MC»bJHl«»* 

[0 5 19] &mm7ayS7 1 6 4 0©l^A*t4, H 

1 3 loju^y^ymm^i s 4 on«t^T4j«sn 

3, /U 1 6 0 4^^<^»^W|§llJ:^T'1' ^- 10 
^l/£ft£*xXX-:/;U£n£o jaSI^Dv^A^Jl/ 
-Di^y^ 1 6 4 5t4AA 1 6 0 
x^7 h£<i;tfAX 1 6 0 3*6£>x— ^^x^ h 
<h, Vi<^0*«|fiE^Dy^ 1 6 4 0©ffl*i:*a» 

flio, cft&*-t©«<o«ian«:*«iB^Dy^ 1 6 

4O0A*K»*. iiyD7?A^-D^^16 
4 5 fc/XA 1 6 0 2*6©*HMB§l:J:oTi*3ft 

[0 5 2 0] fiiyn-y^B 1 6 1 5*4, AX 16 0 1 
*5©t-?*^iJ h£AX 1 6 1 1 20 

6 1 6i^a^-r^o ai^D^jB 1 6 1 stmwm 

[0 5 2 1] #IE©:t^l/-^£t>0^£K:fcV>T, ffl 
myUyZB 1 6 1 Sttx-^^^x^ hAX 1 4 5 

lss^so^naasnfex-^^^x^ hi, 

>hV\*Xl 45 2*6©*<7>FlC#LT, AX 16 

0 3*6©^J««l«*»l**tt , bfc, 8. 87*- 

h©2 5 5/^a««©«T*SROM©HlAft, 30 

[0 5 2 2] lEO^I^-^ftfe^jfcfcfcVVT, fflS 

^D>^Bi6i5ii ywasanfc^o©^-** 

14> tBE©»*62 5 5*5lV^T, *©J6«*7-fey h 
2 5 5Tffl$«» 
[0 5 2 3] HRfl£IIME*fc*V>T* jm?Uy9B 

1 6 1 514, /U145 l*S©2O00*«ftJBl>T 

iftflW»I^P5>?Al 6 1 0*6©«»W»C*IB 40 

[0 5 2 4] 77>f >M^££tfft&4tiffiK:*V> 

T, jBM7uyi?B 16 1511 Sfty 

/but, ffi^wic^w$nfctfi7ir;ucora^«raL, *§ 

to 5 2 5] iB»fifi!lB**fc*V»T, ttMfny&b 



7Uy2t, 9H>mzfUy#Btf)\'-Uz?yt7 1 6 5 0 t 

£<Bx.6„ *«ME:/ny£»4, fil^Dy^Al 6 1 0 

1 6 5 0 K&V^Tte, /U 1 6 0 1, 1 6 0 3, 1 6 1 

1, 1 6 3 h<h, m&zntz 

*«ffi7ny£©ffl*<fc££ttAft, dtt6*8«Sn 

P^7^ 1 6 5 0 1 6 0 2*6<Z>fH»fi#(;:<J;o 

[0 5 2 7] tf -/ ^JbP»gS 1 6 2 Oft, ffliya-^A 
16 10 tKMJUyZB 1 6 1 5*6©<B#Wi&*© 

iK^^te&ts. £*U4, /ui6oi^ifibx 

AtM>^-7x-X 1 6 4 0*6, AX 1 6 1 1 
fSLTJH^Dy^Al 6 1 0*6, /Ul 6 1 
ALrmm^Uyi/B 1 6 1 5*6, -tUT> AX 1 6 

0 5&m&LTl>PZ*7y'()H 4 7 2*6, -ttt-f 
tlOA*SaW*D, AX 16 2 1 fc«^Snfc»** 

fc^yjnffsi 6 2 0 t>, ;U 1 6 0 2©« 

[0 5 2 8] AUDITS 1 6 2 0*4, ^^^B«l®31 

HONS 1 6 2 0 09%OB«ffl3MfP^«H-«Mf^«H 

[0 5 2 9] ^iECD^lz-^S^O^l^ViT, t: 
y if 1 6 2 Ottffii^n^^B 1 6 1 5*6© 2 

[0 5 3 0] jE©*^Ix-**J*:>&/«fc43^T, *7 
[0 5 3 1] 77>f >B«**/«a*SWfc*ViT, 

tf?Mn*»i 6 2 o\$mm7ay2B 1 6 1 5*6© 
[0532] fttt&£mxae£i>T* miv-fzjw 

[0 5 3 3] #&#JD&T SB 1 6 2 5*4, AX 

1 6 2 l£SSbTh>^;(jng&l 6 2 0*6©A*fe 

yhfcfttt, I/^X^77-fJH 4 7 2*6AA1 6 0 
5©BPffl^fcJ:oT**Sn*. BPfS-ft£S?#5T5 
tt*ftttT©*fc3)i-r, ^0^T6nfct«^(4AAl 6 

2 6fc«tt3n*. 

[0 5 3 4] »9ff—7J\, 
t0 5 3 5] 

&2 7l ; 1 : -M V§§ 



If If IB 




( 86 ) 



11-85969 



169 

& 27: 77^yg > : r—~7)V 
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bp ^-f-JPF 




0 


Bottom 26 bits are fractions. 


1 


Bottom 24 bits arc fractions. 


2 


Bottom 22 bits are fractions. 


3 


Bottom 20 bits are fractions. 


4 


Bottom 18 bits are fractions. 


5 


Bottom 16 bits are fractions. 


6 


Bottom 14 bits are fractions. 


7 


Bottom 12 bits are fractions. 



[0 5 3 6] #»WD»Tffll 6 2 5H #&©9J9j£ 
T©^K2O0fMI|*fr3. 
[0 5 3 7] 1) «0»T6ftfc»*A«fi"C*-&*i!5 

[0 5 3 8] 2) «0»T6nfce*©»#tta*2 5 5 

[0 5 3 9] 2y>7Xteyy^~-l 6 3 0 »4AA 1 6 
2 6*ft*LT»ft«0»T«l 6 2 5jfc&A**3W" 

[0 5 4 0] «D»T6nft:ll*fi!)«l*tt**»«^* 
[0 5 4 1] 7-^*^x9 h©7>y7P-S»§ 

z&zmxmiz, ^n^nzy 

[0 5 4 2] Uij^MUMl 6 3 5(4, /U16 16© 
ffii^DyJBOBIAWXl 6 3 \<D$y>-?&1t\* 

[0 5 4 3] #jE©*^l^-^*t>0 % 
£j*fc45WC, ^SftfflSl 6 3 5l4«a^D^irB 1 6 

[0 5 4 4] *IE<!>*tl/-^*"bO, 7HR**D© 
££fc;&V>T, ^Sfbffil 6 3 5B£9>:/*fcl49y 

A-16 3 o©m**aja$-ri-^ 0 

[0 5 4 5] IE(D^^-^*t)O^J«I^ViT, £M 
YbBKl 6 3 SttflSynyjrB 1 6 3 OOVK^OW 

[0 5 4 6] HBtefflira»K*V>T, ^SffcSB 1 6 3 



10 



20 



30 



40 



[0 5 4 7] faommz^X, ^gffcSB 1 6 3 5 tt, 



7> KiftSffll 6 6 ORtfl 6 7 0 t. 30<9^Ii{fc;I& 
SSS1 6 8 0, 1 6 8 5, ^1 6 9 0 ^ 2AJlM& 
ffll 1 6 7 5 <h> 2 ^<2jjn&£&0 2 A**K«5 1695 
l/^X^ 1 7 0 5 i^lK.^ 
[0 5 4 9] FfcfflW 1 7 1 0 tt, H 1 2 9 <OOT 
\mVi?7,$ 1 4 7 0^6^)MODEfI§l 7 11^ 
01 2 9C0A^7<>^-7x-X 1 4 6 0*5020^ 
SUBff^l 7 1 2£tfSWAPffl*|l 7 1 3»t*SW 

KttaifiB 1 7 1 o a, ^nbcom^-^m^L 

T, jtaff*^5>FiMM8l 6 6 0£<fctf 1 6 7 0(1:, 
^S{fcffeSSB 1 6 8 0, 1 6 8 5, 1 6 9 0 \Z& 

tt, **«yny^ i 6 4 o*a/fa»fP©-r**«t5 

\zm$L?Z>o *«Mgyn '^ 1 6 4 0H 8 ^©^- F 

[0 5 5 0] 1) iJOjjW^E-F : SUBffl*f 1 7 1 2fc 
«fW A^l 6 5 5 £A# 1 6 6 5 
14, A*l 6 6 5*>63K. Sfc, SWAPfI^6 9 3 

KfiEK AAS^yy^-rs^tfeTSrs. 

[0 5 5 1] 2) /H/Ut-I^ : A* 1 6 5 5 %mtl 

[0 5 5 2] 3)iPlt-K:A* 1 6 7 5»7r 
^<hbT, A# 1 6 5 5 t 1 6 6 5©H**im-r5. 
SWAPff^l 7 1 3fcfi£W A* 1 6 5 5&i;tfl 6 

[0 5 5 3] 4) ^l/*»€-H:A* 1 6 5 5l:A* 
1 6 7 5£»tt, *©»**2 5 INCW> 
X^ 1 7 0 8©BAH iEUMB**#$&»fc/lA 1 

[0 5 5 4] 5) *JPE-F:A* 1 6 5 5teA*16 
7 5 «»U 

[0 5 5 5] 6) ftl^*J;tf7l/iltJpE--F : A* 1 
6 6 5 £A* 1 6 5 5 fclD**** A* 1 6 5 

5*-6§|«, *©IS*fcAa 1 6 7 5 *UT, 
£©S£2 5 5TgH£o INCl/^1 7 0 8©|ilA 
HU jELl^SRW&fca&te/*;* 1 7 0 7fc**Zlfi!>A 

[0 5 5 6] a0g*^>F&S«J16 6©&tfl 6 7 

fKS*ci6;i;xA*^^i©»Sc^»§o lMli 

6 7 5H XX 1 6 6 2 1 1 6 7 2©lD**^5>Fn 
?y* 1 6 6 0»tfl 6 7 0©ffl*£-&gU *-©*H£ 
/U1 6 7 7 fcffl*.-r*. 
t0 5 5 7] &maiyy9 1 6 8 0, 1 6 8 5, Xtf 
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1 6 8 2*^0>A#**AX 1 6 7 7A^OA*l:»H- 
3. -tUT, iftfS«KAA 1 6 8 7&£Zfl 6 9 2)d^ 

[0 5 5 9] JnJMgl 7 0 0U, SITSl 6 9 5<Dft*f 
£>T{£8 t:y KcSKffll 6 9 5 coai^c0±f5 8 \iy h 

zijutlz. jpssnoooffiiffaiNc^^^i 
7 0n:7y«ns. iNci/^x^nont s 

*§1 7 0 2fc<fcoT<*-;/;^ft£. 1/^X^17 0 
5tt»S8Bl 6 9 5*^©»&E«T-&. ^nfeff^l 
7 0 2(r«toT-f *-^JP$tlS. 
[0 5 6 0] H 1 3 4tt, ^ftfPO^Dy^H** 



[0 5 6 1] 1) S^tf^irJUx-^ : £03WMB*x 

[0 5 6 2] 2) ^**'<^>K:A5-tTaW** 

[0 5 6 3] 3) «*:*^>Kx-^£^Stf£- 

[0 5 6 4] t!^-fe;i/x-^»4ASWtr4 0©5 1 ^*^ 



( 87 ) 4#HB¥l 1 - 8 5 9 6 9 

172 

e^-fe;w5 f -^»4yi/ai»snT*>, smsnfr<Tfc 

[0 5 6 5] »aft*lfi«T*fTSft-6^«*^**l 

(acO, aO) t4yi/»»*nfctr^-fe;^7-ac 
WliSaO^ rtt B t7t5/r M, wc (M4 
yy-?/2y>7- *"<l/-*S«#U * 1 KUJtt* 
over, in, out. ato p^&^^l/— ^CDjS 

[0 5 6 6] HI 3 4l:*»§^Dy^ 17 6 0 
»4, 3^©*7-WDy^tTS»WDy?* 
JWTf*. ft*©*?— *77p?int* A^lf^-feJU 

[0 5 6 7] 



P I XEL Compo s i t e ( 

IN color A, colorB: PIXEL; 

IN opacity A, opacityB: PIXEL; 

I N c omp__o p ; COMPOS I T E O P E R ATO R) 

( 

P I XEL result; 

IF c omp„o pj&*r o v e r, r i n, rout, r atopTj&St 
THEN 

color Ate o 1 o r B£X7y 7T£ ; 
opacity A, opacit yB$X7^^t§ ; 
END I F ; 

IF comp^op^over, rover, loado, Xt4> plus 
TSStTHEN 
X=l ; 

ELSE IF c omp o p^in, r in, atop, Xt4, r a t o 

ptWi:THEN 

X= o p a c i t y B ; 

ELSE IF comp^op^out, rout, X14* xorT$>£<!: 
THEN 

X=not (opacityB) ; 
ELSE IF c omp„o p#M o a d z e r o, 1 o a d c , Xf4, 1 
o adc oTW^THEN 

X=0 ; 
E^b i, t £ i 




.1 fe 
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ELSE IF c omp op^plus, loadc, Xti, loadc 

oT&£,hTHEN 

Y = n ot (opacitya) ; 

ELSE IF c omp op^p 1 us, loadc, loadc 

o7?&£ th TH E N 

Y = 1 

ELSE IF c omp o p^i n, r i n, out, rout, loa 

dzero, Xlt* loado THEN 
Y=0 ; 
END IF; 

resul t — coloA*X+colorB*Y; 
RETURN result; 
Ait' load' t 9 loado' WSSft>W 

[0 5 6 8] HI 3 4\Z&VtZ>?Uyi? 1 7 6 5 tt* 7 
Uyi7 1 7 6 0<DM%&#7>7£it\i&7 7 

^D*;^ 1 7 6 5**X7^y-r§<t5K«|j*StlSt, 

( (x— min) mod (max— mi n) ) +m i 
n, 

/httta^ttfeatfle-rs, ltb* o 

t2 5 5A*S*bK 

[0 5 6 9] HI 3 4fc*W-&:/n^ 1 7 7 0*4, ^ 
Dy^ 1 7 6 5*60i*t^l/*»t*. ctia^u 

ft**?"*. 2 5 5/oC9ffifctg7«X>^>ft<2ROM 
*6#6tl«. ROMrt©«48. 8 7*- VyhTgE 

Void line draw (length: INTERGER) 
{ 

d=5 1 1 - length; 
i n c r E = 5 1 0 ; 

inc rNE=5 12-2* 1 ength ; 
r amp - 0 ; 

for (i=0; i (length;! ++) 

{ 

if d (=0 then 
d -f = i n c r E ; 

e 1 s e { . ; , :; 



[0 5 7 0] ^5>H*J««1 7 2 nmi£<DMi&mt 

[0 5 7 1] 1) 7>7&f& 

2) mm 

20 t% oa>S 2 5 5 *^«»i«onr*ftj«*£/ST*. 5 

tSSA*2 5 5 J:0gV> "TsrVf" *-H©-2^» 
5. *-Htt*3<B±ffi2 4Hy MtioTft**. S> 

ifc im Xf^^-F^V^ 

[0 5 7 2] FlC^T, -&*£x>i?>te 

Xf7/fi2 5 5/ *#»£fc»l;:8. 8 

7*-T*; hOROMtfflVi*. 1 6 If ? hSR 

30 JMBfcJofc&ft*. SRJMOWAttSliy hTiflO&T 

>S»»4B r e s e n h am©8I7^'JXAWc 

f. 

[0 5 7 3] 
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} 



[0574] Blend= ( (end-start) x 
ramp/25 5) + s t a r t 2 5 5 \Z£Z>%m\Z 
ttLT^OitTtffrtens. ±fBi£(4. 2 ^©jJDfffgs 

S-^-v- >^;PC07 >7 f lcJ:oT (e n d - s tar 
tlC*ft) "7*HftJT SfffcSyPy^tt&Bfc-f 
•5. 10 
[0 5 7 5] ±x-^A7.gB2 4 2*tfr&5di:©-e# 

IBIX& (GCSC) ttW**5-«t*»*&»fcK- 

*7c©A*^ra* e. i ^Tcfc l < a 4 j*c7cm*^rax* 

[0 5 7 6] U<^©«^-C*V»Ttt, fi«©Xy>? 

fcV>TifU<&-5. ^©^B&fflft-5fcfelCGCSC 20 
Ht, WJ»fcltatb*fiSI»HIK*V»TIH|CSn«C4:*» 

[0 5 7 7] 

0 If x^63 

out = 2 (x-64) 1 f (64^x^191) 

255 1 f (1 92gx) 

;k (arte. x+a-$ns. e&#«gfc 30 

>5< >S4^rSfcJe>Ctt^©S|liA*^T 



[0 5 7 8] 1) HI 3 5l3CTJ:5&BflUMM>;i* 

vegxa-rs. ^n»:ioTBWH»©x* 

* >9-f >££l£?-5fc!&gfcy-XH«UDfc!^1:;i'* 

[0 5 7 9] 2) y-XM0«SflE#ftftHlrt'«. 

[0 5 8 0] 3) BWHft©*^ att9"f^>f l )> 40 

[0 5 8 1] 4) ±EflHlftaa«lc££U. &gfc-y- 
[0 5 8 2] -y-^>yu>y, *tffi> Bftt^-fcJHD 



&gft&Pg©7*ny#iaT&-5o 01 3 6H&gfcy- 

x H&© tr £ t ;i/*^iJlH nl^T & -& t> © b r n 

•So 

[0584] awe^-fe^sthff-rsaaciMHi. y 

[0 5 8 5] ±T-^/t^flB2 4 2 icistt -53^^88^ 
IC«tcT§lttB$n^li^^X>v'>©^D-y^0$; 
0 .1 3 7 C^-f. ®ftg&X>i?> 1 8 3 0 147 K 
£jj£ffl! 18 3 1, 7>*JMS 1 8 3 2, 1 8 3 

3, 8ggB1 8 3 4. «»T, Jfttt«t&tt 
-51138151 8 3 5;fr>6,ft-5. 

[0 5 8 6] jfh'kXif&gBl 8 3 111 t£M\Z!7±.)V 
*^-rs©C£-Hfey-XH^©x, y«|££j£-r 
S. £tt»4A?K ^y^X^— 1 8 1 5 <h 

Btl8 1 0©tf£-fe;i/#>£-f >x->^X*7-fe-> h£ 
$tsbZ>tztb<D7 H kX*4fi£-T-5. 7 F kX£f£8B 1 8 
3 1A«y-XBI©x, ytt*^-T-5MCA-^;Ux 

V y h Ktt 2 ^©ffl«*t* 0 . 1 3 8 \Z^t. 

1) y-XMfcORttft* <«F*fcL©B*/Mfc*. 2 

4. 2 4 fitg) .&■((>. 0) HJMt©fe±(re* 
*. 

[0 5 8 7] 2) m&<DV7V>-f)VT)Vir (2 

©MR. 2 4. 2 4HO) 

3) ftt«4kiffiWMz*ft*2^W&<D^tt 
■f 3t!y h©bp7-<-^r ! . HI 5 0»4b p7-f-JW 
F©Jfe8£*-©iK9I£^f. 

[0 5 8 8] 4) XJFfraifMfc. Ctttt2Ofl0 2Stt 
@ (2©fflfc) »J£" ^MbMAftottCDTft 

0, 2ii/hSbS©{ar®f4bp7w-;PH»cJ;t)BiP^WIc 

[0 58 9] 5) A-^n^r-fX^U 7*^07- HO« 

[0 5 9 0] SV^-^X-fX^U^JCfeUT, x 
©H*&&£©5fcR$£l&V>fcflfi©/l 7 * -5 «&© J: 5 

[0 5 9 1] x©MttftB©£Mfc<-0. 

y©M*fi*e<-0. 
^fJl'K-i. o, 
itf^K-i. 0. 

7FkX£j£$l 8 3 lftMftlftStifctft. SMSHfittff 
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[0 5 9 2] y7V>7)\sft¥\CD%iK>t}nx lTU^m 

-&> 7Ki/xfe«3Ki 8 3 i\mF\<Di-o<Dmmz>$. 
m&&izmx.z>. THixx^gui 8 3 ltt^jsa* 

;p^*3K. *©ffe, 7 FUXfo&ffil 8 3 lUIilf 
>3*U £0;&i£l;:J;oTSft©x, y*b#^££tt£ 

[0 5 9 3] 8«B*t*b&«, 7Fl/^*rt«18 

3 KI'X 

«1 8 3 l^OOfy^Xf-^W^W 
*«) c -f ^y^Xx-^l^-X 7^X14 (y + 
0) fc*»t«< >7*;Wf- 7;Ux>HJftSf. -f 

7Fl^X£ric*l 8 3 ltt*n*xJS«(CjD7e 

7HK*£j««l 8 3 lttx£«**©>f >xy#X:*- 
[0 5 9 4] ft&^l? 

Ait. ^^mrAoMjjvztMz&fzftmvm 

[0 5 9 5] HI 3 9fc*5V>T, +ft©^7^9Att 
[0 5 9 6] ^HMMSl 8 3 2tt**T**l«K*'t 

[0597] mmmi s 3 2\mm&\mw<z>Mv)& 

y-xH»*ty©*©j«©2 \z>7±)v&m 
fti;<xo»Rft»v»TiiBr*. 



178 



20 



[0 6 0 0] 2) £T©frMK#1-*±<OlB»**»U 

[o 6 0 i] »sc«i 8 3 4©»»»i, 

[0602] 1 8 3 5 1 8 3 4 <D\tit) 

-r. £n«, hi 3 9ic*5WSTSHo^-f 7y^At* 

[0 6 0 3] ±T r -^/^XgP2 4 2**ff*.5*>5 lr?© 

^n^\ (3jfc«»*iiBK*^<) Httfian** 

[0 6 0 4] fr?B5RJ([Olt*tt*©iC^±t>TS«Sn 

[0 6 0 5] 
*7] 



*»3 



255 




30 [0 6 0 6] ICT, r i ttfeShf^-kJl'T* 0. ai 
ttA*^9>HKi'*feJl'r*». fr^J©-»}-'fXf45?54 

[0 6 0 7] B14 0HU &T-i?K7M2 4 2fc*5V> 

[0 6 0 8] fr*Jltfl^7r«fe*Ctt2*n? 

40 U f£»ffi£i0£$©x-*#iEb<SiR;£n££3fc 

[0 6 0 9] «o-y--<^;ncteViT, tf£-feJU©gTfS: 
2/U Vifi&HikM 1 8 6 1. 1 8 5 2 CfcoTgftS 
ti«. #lc^©&&£fr^©£tli;:*5tt3 2:3©yij, IP 
■fe, *lry5'jLK*»t*»0 9-f>lC**ffJII«RlC» 

»*. . 

to 6 1 6j ^jW=^v»t, tf^-fe^oibl 




( 

179 

nmz&n&mts tr^ McwoteTens 1 8 5 

[0 6 11] ":j-^5>F»gar 1 8 5 611 JOggB 
1 8 5 4<0AW4Ot&5J:'5C**fiBfflASr»E^J 

[0 6 12] "ACttfflW" 1 8 5 5HJOff«<DiaAO 

\z. 2 5 5H±©^T©ntt2 5 BtcaiAsns. "A 

[0 6 13] if-^U«2 4 2lt ±CK^©H« 
[0 6 14] RlHfspJ«irJ:oT3^h^fi«3nst 

-T^- £©:i>fcr;i-*7-*x^*tt*tttt 

T» fcH«ffla«ifP* fc^ff "C* S J: 5 \Z 3 > 

[0 6 15] 3. 18. 6 x-^+vyi'iWIHWi: 

7-^*t^r>aa»8l52 4 0ll 3^Dt-;*2 2 4 
fc*»S4*D/WH©l»*aiLx--^*'1ry^a2 3 
0 *<BATVi«. v y ya2 3 0 f4^-f l/£ h 

ya-^t'J 2 3 0 (H2) fc*tf*HCftS©Hi;? 

[0 6 16] 5 s -^*^y^J.*Ifll«2 4 0142 -3©:* 
^5>H*-^-f1f 2 4 7, 2 4 8*^Of-?S* 
Stf- iff-mtr>a2 3 0Kflt 

^e>7x^^ns, f-?*tyyaanas2 4oi: 

«:/ny?A©T**7Fl'X^«#*D, x-^* 
^ySoJMM^ 4 0 3i«V><^©K&£7 \*lsy*s> 
Wmft?Z><D*:vim\Z?Z>. 5#£tl& 
t-^<^7 H Px*<f t V y y aW»S8 2 4 Ofci: 



91 ) #P8¥1 1 -8 5 9 6 9 

180 

[0 6 17] *^y>jLRAM»480©ttS[bT7HU' 
X>5ffi&**y/04^S»5 (S&£5-r>7FlxX 

*«2 5 6fcfy hK*ffiWt5n-6-»©«fj*7HU^> 
[0 6 18] *vy>aJ4, BffllClg&TSBtT©^-- 

10 FirfcHTftfTTS. &BT*nH t^to+tyy 

a a* a Ana sns & o cto ^ t 1> pjhets 

[0 6 19] 1. /-TJI^E-F 

2. ^-fflA-ttftawHeait-H 

4 . J P E GfSMtfb*- K 

5. fij£J PEGg^-H 

6. ff JOSMSC^- K 

7. rXX-^M-H 
20 8. JBftflS^E-F 

014 1(4, H2i:mf-^*v-; V-J.MTO2 4 
0©7FI^X* x-^ a«7n-tf-?ttr>^ 
2 3 0 t&3S"T. 

[0620] r-^tr>a2 3 0(4, w^i^y^ 

Mm$&2 4 0(4, #+t*;ya7^f>l;*W^yx> 

hutw-rs^^^u 1 8 7 2*a«ut*o, 
*^ ; EU7Hkx©«±ffi«*«rr*. £&, sseco* 

30 tyya7^^W»§^£5^^t7-<>ta 
OMW^EU 1 8 7 3 "MBA*. iT^tr/a7<> 

[0 6 2 1] r-m^>aWP«2 4 0ll, 
>H*-^<1fC2 4 7 (02) t*^<?>H*-» 
±-YtfC2 4 8 (H2) *605 ? -^B*ft*^5>H 

■Mfsfc*V>T, *^5>H*-3?^-f1f2 4 7, 2 4 8 
(02) ©2%6*©-*feU<tlPI*tt<>ry^X 
1 8 7 4 SJBftU x-^EsMHf 1 8 7 6 &arr- 7 
40 FkXfojaBl 8 8 l(4-T>xy^X 1 8 7 4 ICitbT 

i o *> u < n*na±©^an«7 h v a i 8 7 7 & 

FWX1 8 7 l<b9V7YV7s\Z$f$%>$>f*^) 1 8 

7 2ft«aE-r*tt*>fc, santt'r>a7-f>^ 
y 1 8 7 3*fcarr*£fcfc.ko, g#stt&x-**< 
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:Mif2 4 7, 2 4 SKiifcttS. BSSftfeT-^tf 

>?-7i-^ 18 7 1 tAM>^-7x-^<y 
^2 5 2 (02) SaUJT, 1 8 7 0 

*Wffl;**U*£:7xy?3n£. x-* 1 8 7 0145 
*ff^ 1 8 8 2 ttft-h U 1 8 7 0^ 

teftLZtltzTFUZ 1 8 7 7£Jg&-f 3£<hK^T7 
xyf^ftS. 7^7 7 U^ff^fl 8 8 3&tfB#Stt 
1 8 7 0 W-tn-fn+V y jlMWW l 8 7 8 

OD+t^^^^U 2 3 OJCHa-TS*^^ va5<> 
***fUV^-^ 1 8 7 0IC<fc^TH#f2n3o SrLV>* 
t^>a5<>©^y7HWfc^^ : E»J 1 8 7 2(1 

Ml 8 7 3j&Wtl^ 7^/U^ii^l 8 7 
9I4t-^1 8 7 OtifeKHa'r«*^7>H*-*f 
:Mtf2 4 7Xtt2 4 8 (H2) ICi££tt£. 
[0 6 2 2] 0142 fcj3V>T* y va 2 3 

-vyvjL7-f>:B#3 2T&3 1 2 8 5 
-<>C0, . . . , C127&h~D#'(V9Wy-f* 
*f>j.£UT*33tl£. **y$/aRAM»4ffi*© 
7FIxAJt£©T€r***yM>£BO, . . . , B7 
£fttf§UT:fc9, #^ : EU/1>^H3 2My h<Drt>2 
>1 2 8fl©**#"S. ^^V^^y^yC i (48 
^O^U/O^BO, . . . B 7K43V>TffiM§?*£8 

-0<DK>9 7<1>B 0 i B7 i SftS. 

[0 6 2 3] ftrtStlfc^aB^tUTHl/XfiDliJESftH 
14 3fc5Rt". £f£2tl£7FW*t42 Olf-jy h^i/7 
Fl/X. 7t*y F5-r>7Ft^ 3H^h/Oi77H 

-FT**. 2 0fcyh*$f7Fl/Xli*y7Fl'X£ 

99**» 1 8 7 2tcBMtatiTv^^ytH«rr«© 

fc<fcbtt£. 7lfyh5<>7Fl/XH*^^^^* 
>J 1 8 7 OCftftHRrs**?^?^ ->CE)7H^ 

irttbft«. sty fa>£7Fwu4**^>*** 
u i 8 7 oom&tttVrtyzvTFuxiz&toti 

2ify h/WF7FV;M4 3 2 H^FA>^-f> 
©BttTfcW h©7 Fl^fcttfettS. 
[0 6 2 4] 014 4*4> 9 f -^*ir^->3.«W«2 4 
0 <hf^t^r>a2 3 OO$ffi©:/Dy^0£^ 

CilTf, 1 2 8X2 5 6 t9hRAMB*t*y*>a 
^ i EU 2 3 0*«J*U, E*U48O0 1 2 8X3 2tfy 

RAMte#£&*pJflBJp-F (write), 
7FWX#-h (writ e_add r) , mz&fr? 

Ur i t eja t a) fcfco. £fc> M 



182 



Sfc»*-Vy>J.W»> r ni^^ 1 8 7 8*^#$a*nl 

2 3 0 «»£&*t-^#- F (writ e_d a t 
a) ZmCT^mt^VftZO 1 t>b<H-ttl£Jl±©5 
<>©r-^l:ia$Wo #£&^7Fl^#-F 
(writ e_add r) \Zy-i>7 FUX£tl&U 
8 : l^S{kggMUX^?fjffiT^c:i:lwJ:oTl^-r> 
8 : l^fiftSMUXttx- 
10 9*r^yz/zL$mffi (addr_se 1 ec t) (Dffl'M 
OTTtricSnft^ST Fl/X^^-f >7FUX£il 
^"T^o *t y va^t'J 2 3 OOiWtU/O^ 

ynydr l 8 7 8*6«t*^I««^*«**sn*. -co 

^ffiT?, tty^^t'J 2 3 O0^^Vn>9<DS-D 
0»*7Fl/A#-h (read„ddr) fcftftsn 
Sft^r©5-f >7H^fc«i;T, 80©K*f-^# 
-F (read_ata) *>5 8 ^OS&*M>^ ^-f 

20 [0 6 2 5] ft*©**^^^ 2 3 0G>/1>£»4 
yuifyl^imT Fl'AtjfcSSl 8 8 l*»t?TV^-6. 
Ctl»4a5 8 0©ffiK^©, Hat"§8^C!)^^:U/t> 
^*^©HWf7^-bX*Blilfc-rs. ft*C7FWA£ 
d» 1 8 8 1 J47 F PX^riW 18 8 1 Offsft*- FR 

? AvimT F l^X*J5jai 18 8 1 ©fHft*- Ftt, 

(a) d c c FA*^Ofl§#ft*©7 H W4« 
S§1 8 8 l&yyy&TZtX^-FlZb, naiw^y 

30 7HV^<>5 S ^^X/Xiry hA*-N8H»atl, 

fe b < (4 j en«±c^7 F W^tofiS 1 8 8 1 ©7 F l/X 
fflafcfflaanfc^^i^-fc^t-F ; 

(b) dc c*-FA^©M^#£*©7Fl/A£^ 
« 1 8 8 1 FfcT« J P E GX>rj-x-f 
>^^«^ figmJHfe* frWSRS*- Fo £©*-F 

• Ttt, «-*©7Pl/X£j*Bl 8 8 l\Z\t<i>Ty9X 
n^rvhMi^<0^>^y97,ifiXi}^n, <1>tv9 
X7HWS&jct*. ffsft^-F'te-fcoT* 7FI/X 
£&SB*4&A 8 Aft**«;t % U 7 F V-A 
40 i*5Zl£#T£5o 

[0 6 2 6] 8"307Pl/XtJ*«l 8 8 lt48^©g 

7Fl"X, fflaiLTft***y7Pl/Aftl9r>dc c 

[0 6 2 7] ^-^Fl'Xl^A* 1 8 8 5fcM >x 
v97s!\*ry F©&£#T&£&fc<D^-A7 Fl/AS 
EfeU d<i e%-j*t/3>*^l fe 8 8)4^-:^*^^ 
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x 2 0 \dy h©v;^7R-hRAMT«j£Sn5. 
RAMHlt30»**-h (update-1 ine- 
addr) , 1 O^S^oJgg/^— Kwrite), 8 
-DOWtfrifi-h (tag0_data, . . . , tag 
7_data) *8oTV>5. iinte, 8^©7Kl/7 
4j*Sl 8 8 l**gifcfE1&3ftTV>5, l^fcb<«-C- 

HUX*ftS-rSd<tlCJ:0#- h (r e adO 1 in 
e-addr, . . . , read71 ine-add 
r) \Z#^T8^<Dm$<D))/y>7 7y-?$:vimz.TZ>* 10 
znt>y-0<DMl£<D?tf7 HU^ttTR-h (tagO 
-data, . . . , tag7-data) &t>#{f\k 
Ml 8 8 6Kffi^£tt£o jR-h (update-1 
ine-add r) ©i^^U 1 8 7 2^©»*&* 

[0 6 2 9] 1 2 8tfy h©5-f >v a 1 id^tUl 
8 7 3H ttty^a^t'J 2 3 0©#*ty^a7^f 
>©val i dttM*«t>Tl>§. £fttelo©§#/tf 
— h (update-1 ine-addr) > 1Z><DS 20 
#pJSg#-h (update) , 8 0©tt*a*>R-h 

(readOl ine-addr, read7 
1 ine-addr) > 8 ^©RfrnJflBTR- Klin 
evalidO, 1 ineval id7) *5 

ftSl 2 8X 1 \zy h©^^EU"C*5. ^^U^H 
L^K, ^n^8CKD7Kl/X^gUl 8 8 ltc, 1 

0y-f >7FUX\zttLT, Mft<D7-1 >\z±-7zn 
T^7^>val i dttJB**S*-fr«21fclcJ:D» 
tR— h (readO 1 ine-addr, . . . , re 30 
ad7 1 ine-addr) iZMLXOS'OOMWVy 
t?7y7%pJm\Z*TZ> 9 :©7-f>Ol7<>val i 
deify MSttf—h (1 ineval idO, . . . , 
1 ineval i d 7) *&^£Ottfc«l 8 8 6\zmil 
Sns, ffrSKcfcoTfct 7<>va 1 i dttfc^U 
1 8 7 3©«e*-hfc, tf-h (update-1 i 
ne-addr) *65<>val i dttlB^'J 1 8 

->a«W^nyir 1 8 7 8 

[0 6 3 0] ^yjt(Wl 8 8 6tt80©^yH*W* 40 

K^^cfcoT7i7irX$n^^-f >G)9tf**:V 1 8 

a£>© t a g_d a t a A*, aft^JsSStlfcnaiJ^^U 
7 YV7,09tf7 \*UX&\mz>ftiib<D t a g_a d d 

■WFE-I*W» (dcc_jnode) *3tt&£fc»0D 



&7-f >v a 1 i dttffi*3W-Bl*fc»© 1 i n e_v 
a 1 i dA**J$oT^5. fcMftWl 8 8 6118^^7 

fi^x&ssbi 8 8 i^n^nic*rbT8o<z>h i t tu 

^£J$^ 0 4^^n/cngB/ : t , J7F r l/X(0^^7Kl/ 
X<h, 4jS3nfc#va^ : EUco5<>7r t ^AirJ:oT 
7^irX$tl^egiCS^)^^^ ; t , J 1 8 7 2cort§t 
**-ai"SB#, h i tS^t^Oy-O^Oy-Ov a 
l i dttttfcfy M 8 7 3**ffl*£tt£. r^JS^JT 

fc?K ^y7KPX^«±ffifcfy h*^rraDT?*-5. 
8oT, ?#7 FlsXO&ik'tZSkT&Vy hfzttZkt 
ttt-nttfil^ £fttt*ifH«« 1 8 6 6 7 K 1/ 

(dcc_mode) SRJTT * - fcT>>r«Bfcfc 

[0 6 3 1] +-^^^aLtt»ffll 8 7 8B+V^vjl^ 
*U 2 3 0K*Sx-^©7^"b^**^I«B&fc*, * 
^7>HB24 7, *^9>HC2 4 8*6©K* (p 
roc_req) <hil£j (proc_ack) ft2W"Bl 

8o3n?©/1>i^SSfc57 FIxX(Dx-^3&«5*S 
tl5. B^x-^+^^^jL^qEU 2 3 0**67^* 
*ytt*HBl 8 8 6 5 -f 

>ichyh*m-r. usnfctyhit (hi t 

0 h i t 7) IwMLT, ^tr>a8Ml 

8 7 8H3R— h (cache_read) 

>^©K*a*ft^riBK:r«. tyha^ (h i t 

0 h i t 7) Tttfc<g:£ (pro c_r e 

q) 1 8 7 6#fflSftfcl$fcHU fe££nfcM# (ex 
t_r e q) tmZT-PO** y y^H >©ftffi* 
*U7Fl/;*#*«**yfc2SftS. £©**y->j. 
5-f >I4A* (ex t_da t a) ^Rlffift^ 
aCT+^^^i^^EU 2 3 0©8OCDA>^^#^2 t 
*tt£. ^«B*)9>f>7HlxXfiE)^^ 
*U 1 8 8 6\zmg&&tl, ^(Dy^XDy-OVtm)! 
yH 8 7 3«t|BASW. 
[0 6 3 2] *t^r>a^ : Ey2 3 0«)8O(D;Oi7^ 
&©X-*MU x-^*-«f^<if 1 8 9 2K££U< 

-9K*rv h 1 8 9 4fcffi*#tfS*l*. i&fcftfe*- 
PTf-^t-JJWlfl 8 9 214, aft**- Hfl» 
(dcc_mode) ££j*3ft&*MM*y 7 Kl"X 
©/Vfh7Fl/X (byte_addr) *fflV^*fc 

br*h7-F*S8lfyh7-FftWk ttWrrsct 

^ot-HTx-^^-^^^i 8 9 2 
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[0633] mmv&v>wm-zftt>nz>. 

[0 6 3 4] 1) yDt'r»^l-7hlS*t->->J 

mmmi 8 i 8\z&z>7n±yz/>>?3--y y^y*- 

[0 6 3 5] 2) 8O07fWMl-'>hl88 1 

[0 6 3 6] 3) 4fi£^nfc7Hl/7>(7)^^e«3* 
-hO^^^^'J 1 8 8 6<D4-7ay$\Z-k-7-2tlT 
^5^^7Kl/Xi:tfc«E$n. 8^co4fi£$nfc7h*U 
7. CUBITS 7-f >85lr«t-DTffiH^te.n-5. 

[0 6 3 7] 4) ftie^-tL, •5-©7'f XD^'f > 
val i d^ffil 8 7 3^W$hfce>, g#Snfcx- 
^te+Vyi'a^D 2 3 0K#ft?-5£*fc;*tX5. 

[0 6 3 8] 5) £&LfcV>x-*«ftS5A*7. 18 9 0 

©7"n y i> * U frZOr-t'y'f >©f*J& 

U 1 8 8 6K:#£&£n> ■f-©7'T>©7'f >v a 1 i 
d«S8l 8 7 3**HlSn-5. 

[0 6 3 9] 6) ^TCgjfcx-^+Wv'i^U 

2 3 o cs&ftitf. ^ti^ftfe^nfc/t^-y yw&r 
7u*.y=s>if3.-y y\z$inz>. 

[0 6 4 0] tMLfcilO. 3^Dt-yy-2 2 404T 
<DBft (02) «&ipCBu s-f >*-7x-7.3 0 3 

(02 0) €r^#T^5. V ! S3.M®W>2 4 

0ttfr>j2 3 0©8($CBus-f>?-7i-X 
WS?7.*©SMi«U #®B©B4 2^e.B4 6 *TIC|E 

0 ©feSfr&Wrs. ffi#©fc«>> 2l<r>VV7s5 (b 
a s e a ddres s £ b c c mod e) £:tf&0 

1 5 3 \Z^t. 

[0 6 4 1] f-^+Vf>ai»»2 4 0i:f-^* 

ttfilO^T©** y ->a >*«S«&fcUT©^- H 
T!8&fFt5. **#^©«Et)0»Ctt, T-^*f-/->a 
WW 2 4 0 £** 'r>a2 3 0 ttVOfcWMWPE*- 
H»cWt)#to*. " Inval idate" ^E-Ffclfc 
Ufc^TT©*- Ffctt" Auto-fill and 
val idate" tMtt-f s s 3 dc c_ 

c f g2l^**l;:ifcry hfc-fey -KTt'cfcCJ:!). 
it©+t^>a4b a s e_a d d r e s s 

£©ilf^©ra, J*1?>,F*- ^*"f1f.B, C2 4 7. 

2 4 8A^©7 f -^S*«+±SnS. *^y->att£ 
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7.£fi£SSl 8 8 lWftWt^VjyUZ&tiSJlL, Affi 
* =t U ViT-€-tl*M * U * V y 2 3 0 fc# 
ft-f-5©fr£)8fr#3. i*©8*f-?)^*ty>a 
2 3 0C:#fcLftW8£\ A^-f >^-7x-XX-f <> 
?2 5 2*>67*-^**K*Sft*. jaWWfloBWWB 

n*. 

[0 6 4 2] FJI*ttfcB#fc#U l^®f-?7-ff 

10 X©&3©3 2Mv hfctfcUfSi'SC&.B. IfiOr- 
9\tXM >?-7x-7.X--f yfMWmrj* 

b. S/>^a*-jKfiai«He»qE- H 

1 2 F/Vf F©7 KkX^STtb^nS. 06 0 
K^n-CV^lC. S*f-^7-ffAtt8f7h* 
7-ffi*{lt?fe5. 1 2tfy F7FWUi7FkXfe£ 
SB1881©inde x_p a c k e t \jj \ZXii £ 
n, 8 0©7HPX4JS8I51 8 8 1«0 9 6lC*StlS 
20 »5£©3 2h>Ffta&**y7FPX*£j£-f-5. £© 

4s£snfc7K^x©A*>^. 7-f>, jwyryisx 

ttSU 2 £06 llCJ;-3TftJe)e.ns. *MM*U7F 

«MKsn. fniiRAMos^o/o^c/H Fsirr 

2 4 2ltJ;»3^7>h*^— ^-flfSE^. 06Oi^ 

5fc»K7^-fexsn*. ±x©->>^;mi?3-)K*7 

-fflr— ^Hi+^y-^a^tU 2 3 0iOR*-5fcJe>, 
30 *s> if )V1iy y&mH?Z>mz><>{f)VtiiJ] 
*7-fflf— T'M+tfS'a/t'J 2 3 OKD-FT 
5©**M*UVi. 

c. v^ffl^j-isfesra^iftt- f 

dO^E-FTM*. 1 2ty h»7-H7Hl'X***Tt?> 
H*-*f*"f 1f«B 2 4 7sfc$Stt&n*. B#T-* 
7-f xAttH 6 2 *#HbT*»a&b& 3 2 F# 9- 
tBTJffiT&S. 1 2£y F7FW:*«7Fl':*4f£ffil 
8 8 1 © i n d e x_p a c k e t A*fcA7J**U 8 
o©7 F l^£»£S5 1 8 8 1 «, 0 9 6 K*3*l«#5i 
40 ©8 0©S&$3 2 \*y F0-«5;**U7 Fl'Xfcft*. 
nai^^UTHl/XO^-f >£^y7Hl^Xtt, SI 2 
£06 3 IC«t^T*^$n*. ^ffl^ : EU7HP'Xtt, 
06 3*#MUTt9iebfci;5IC, 7tfy h5-f >7F 
WX£ 2 h^^7 H l/XK#tten<5 9 h7 H 

^y7H^x*t56^.an^ofc<i-&, A*'f>^ 

-7x-XX-Ty^2 5 2 (K2) '«»6«Wft^T^** 
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d. J PEGSFf^-H 

-^Vl^EfttCO^Ttt J PEGff^K-F (Si 
4, 16) ©chC^KJB^SttTV^. 

e. fSoiJ PEG«^E-K 

f. fT^JSff^-H 

g. Disable d=E— K 

-f yf2 5 2K:/1X£tt3„ 

h. Invalidate m$)lk) ^E— H 
rot-FTtt, 7<>val i dtt«b:y h*^'J7 

So 

[0 6 4 3] 3. 18. 7 AM>^-7i-XX< 

02t, A#^>^~:7x-xx^y^4^ir;kt- 

#:Mlf«2 4 6, T-m*r>a«W2 4 0, # 
^ffiffilg82 3 S^SORjRx-^ftWMS'r^faailfeRfc 

-AJM ; E ) jaW2 3 6fc&»fc7h # lx*i:x-\** 

[0 6 4 4] AAO^-7x-X^^f2 5 2 tt^ 
-X7h^X^b<^*Xh^^UV^y(C^§^^U 

2 oa©7K^Hy h&&m&tz&, zn\$^~- 

««*6©«*fc#L/T, A*-f >*-7x-XX>fy 
^2 5 214, Srx-^©HiS7FlxA©±t£6br>y h 

[0 6 4 5] H*©±tt61fyh^0Tr*«»^tt» 7 s 

[0 6 4 6] 3^D-fe^!K!)iES&ffia»4, 
1) nyn-byi}-©^-A7Hl/X36^6*7-fey FOx 
0 1 0 0 0 0 0 0*^tt^^-^>?-7x-Xtffi 

to 6 4 7] 2) n^n-fey^o^^EU^^xjr 



If 



[0 6 4 8] ^S^nyD-fe^iJ-OtftBSrmrB*^ 
A^-f>^-7x-XX<^5 1 ^ct:DX7-^l J ?i$tl 

So 

[0 6 4 9] PC I AX^nyniryy-co^^'J^^v 
x^ h*^ft§««£Jino7 FUXOr-^y-Xtft 
So A*1'>^-7x-XX<-v«I*7"?^PC 

1 AA*> <Z>*\ -^ntfe— jR-f >?-7i" 
X^5©feO^*E I CK&J^S/ctf) i 7-X{f^£ 
fflV>S. 

10 [0 6 5 0] 7H lxA«#fflS©». jEM§&B#ttB# 
fcl Bu s<>^-7x-AK£2!3ft£. EICfcL 
MCtti -ac k«^*«ffl*n&Wp, AM>?-7x 

;p*-*^-nf«^«tow»an-6 i -oeflr#, 

[0 6 5 1] AW>^-7x-XX<7f2 5 2J4fc? 

7x-AA-f y?-©K)ei/^A^fc*Stt1l|tfy M4, 

So *ft<tfc2o©H«^7CfcHC«*IHffi*«4«. 

[0 6 5 2] l^C5V-AJrit'&JC7i7irAT ; &^ t7 F 

pJteT;fcsfcg>, x*-f >^-7x-xm y^fiw* 

Stlfc7 ff -^©7Hl/Xt/t-^hftftB*b, S^tc 
S&fiW&S. **V-*fc^B»8©*T?* IBu 
40 fcS« 

[0 6 5 3] HI 4 BfcHMH >^-7x-XM 

2 5 2 ^7f2 5 2Hg?CBus-< 
>^-7x-A^l/> ? A^7r<;V8 6 OA0MC7KI' 
*«#88 6 3£pJg&aS8 6 4<DKtC2oOI Bush 
7>^>-A6 6 lftj*^. 

[0 6 5 4] THwwsesttw-k**-^ 
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g;R£ I Bu s h7>y™A6 61i^IBush5> 

z/-rte 6 2Kearrs©*£ft»*. ©ft*if47n^ 

[0 6 5 5] IBush7>y-A'8 6 1, 8 6 2 tt, 

^CDiim * plfleiCT 5&»© h h ©Ay 7 7 

-'J^WIESWbTHS. 

[0656] 3. 18. 8 D-;fr;M*yS!H8« 
H2C*^T, U-1i)V*^)$mW>2 3 6tt, D-# 10 

fi^-fSo n-*;M ; EUIi!«*2 3 6tt, 

:Mlf 2 4 9*5©»*a*»*t A*-f >?-7x- 

K, «ia-f >^-7x-X«W«2 3 7 i:31«<0-iec 
Bu s A*36^©»*lHbt«*ji*B*fc»lxT'bJS 

«*ft7rS&86fcF I FOAy7rtRffll/TV^. 20 
[0 6 5 7] *5Wifc43V>TI4, 77"XH>-7r 

-xh7£b (fifo) nvyyomz, *^)7V 
x h y-f ^ 5 * 9 * * u Mfiffi^ v>s nx . 

[0 6 5 8] H146I4, *«*»©» 1 ©*««!;:« 

y£B*SbTl>S. ^©laJgSKte, «>J7K19 
1 0 ^©7?t76*Stt« 2 0©»£&*#- h 
(Al 9 44tB 1 9 4 6) 2t>©(fc*tHb#- h 
(CI 9 48£D1 9 5 0) *t**tlTl>5. g^fflb 30 
*-H 948, 1 9 5 0©x-^AXt4gi]<g©F I F 
01 9 3 6, 1 9 3 SgftWtUTK 1 9 1 0*6 
HiT<*©fc:*fU 2^©«Sfc*#-h*S©T-* 
AXMSUH©F I FOl 9 2 0, 1 9 2 2£&D>#fi 
ftffll 9 1 2SAT/%U7I/< 1 9 1 0CM*3. + 
*6IWl 9 3 2tt, m^-^^^yHlOA,© 
-f >^-7x-^fc«»3a;4T©3 > h n-;m*Stt 

Oft* tmz£fr<Dtf- YJZ^L7s ^ilt^c U7lx 
y5o.#9>*l 9 3 4HU 71/-T 1 9 1 0 ©fc 

» t^-f ± $ v >? * * u © y 7 v v *s atM * jvafts 40 
ihwbbi 9 3 2 t^iccn^siws-r^o 

[0 6 5 9] J?£b<»4, «»J7W19 10K«t 
*x-*©«*fflbt '»*&*#- M 9 
44, 1 946*5F I FOl 9 2 0, 19 2'2^ A 
&FIF01 9 3 6, 1 9 3 8*6K»Hll/#-H9 
4 8, 1 9 5 0^©teS©2«©^-hTffxt»tl^o d 
(5D»*/#*a*i:K*IBt*-H 9 4 4, i 9 4 
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&S^ : EU->X , rA©zKh;U*y^T?*5) *pJIB&|fi 

[0 6 6 0] x-^14, b 1 9 4 4, 19 

4 em^ftDd^lftlt^^'JTK 1 9 1 0(C#^ 
&£tt£o M 9 4 4, 1 9 4 6 

ftfcBBtt, ffl»HHfO©F IFO 1 9 2 0, 1922 
©**B»fS«PtC&5. h 1 9 4 4 , 1 

9 4 6*iIbT©x-*<62itt, FI FOl 9 2 0, 1 
9 2 2^-»l:&5)i\ XttA-A h**J»7-r5*TX 
A-XlCil^VKo F I FO 1 9 2 

0, 1 9 2 2JC«#iiSnSfc, $(»ffil 9 3 2J4DR 
AM'\©7£-feX©fc»©ffi©#- h£©#«*fr3. 
7£-feX**#Stt5i> x-^t4Si!5l/-hTF I FO 
1 9 2 0, 1 9 2 2*5K*HlSn, ^'J7W19 

1 0 l£#£&£ft.5o D RAM 1 9 1 0 ^©A*-X 
Sra*U-^f^;W4, FIFO 1 9 2 0, 1 9 2 2tC7 P 'J 
■feyhanfcfc©5 s -^7-K3ft«K t iL6nfc»^ Xti 
•*ji*#-h*6©/1-xh*J»7bfc»'&©*fcll 
*&3n£„ V>t*ft©S'&K:43V>T i b, D R AM 1 9 1 0 
-\©A-X httffRlSn&^jft^SJi*, F I FO 1 9 

2 0, 1 9 2 2#SK&£*, Xf4ct 0*V^5<*U 

T-^^-h*^©^ ^;«**«*«»*-iriK<. wrti 

©'f'<>M£i3Vvrfex-*tt* F I FO#9fem"3 
*> X»4£ft©A-Xh**l*7U *rfc&A-Xh*iB! 
jfif S*Tf> »*ii*#-h*6F I FO 1 9 2 0, 1 
9 2 2^SB**nS:<IIKtfT»#ii*ft«, &#©4§ 

£rbvv\*-x M4, Ettt©/t-x h**F I FO 1 9 
2 0, 1 9 2 2^lCbTDRAMl 9 1 0tC#£&£ 
n§£Tt4iifir£tt&V>. «f#©«3ictt, g«J©7- 
WIFO 1 9 2 0, 1 9 2 2*6K*MSnTDR 
AMI 9 1 OKSSasn^^x-^fcSjJWBIS 
ft*. F I FOl 9 2 0, 1 9 2 2*6©t-^62** 
htftSftft, t*a»#- H 9 4 4, 1 9 
4 6*** h-;i/T5©*4, ttflltt 1 8 3 2 h 

-5, ttii^#-M 9 4 4, 1 9 4 6*6FIF01 
9 2 0, 1 9 2 2^©x-^te«^r§Vi*^^SJt) 

[0 6 6 1] ffiJMBb#-M 9 4 8, 1 9 5 0J42!© 
«TBMW K^tBb#- M948, 1950*^ 

19 10*tKSn, »^T^FIF01 9 3 6, 1 9 3 
8Cf-^*tSatW. «a©x-^7-H««F I 
FOl 9 3 6, 1 9 3 8 K^&Sm^^fc^, ft* 
fflb#-M 9 4 8, 1 9 5 0\Z£Z>WtfrtilL1)mm\Z 
c©±^feftfij©r^7-^4#§fcti^il 



( 

191 

MS****«*\ fill&tii IF I FO 1 9 3 6, 1 9 3 8 
left fcttHtofflL-tf- M 9 4 8 . 1 

9 5 o*t-*na±T-**»*b&<&-3fc&i*7-r 

«. -ficOct^irUT^UJU^T-r^i:. FIF 

0 1 9 3 6. 1 9 3 8^7^ -fey h3ftTV»-&T-*7 

-h*©#»t&$a*T£*3;TffM3ttftu. -ss^ffl 

bTp-h*^ ^;H&»7-rSt, FIF0 1 9 3 6. 

1 9 3 8Ca^TV>«V^ft*r-^fc0g**n*. 
[0 6 6 2] #lCDRAMn>hO-JP^ffl!/h<a^±[Hl 

SJcSlefSfc*. T'Uir-y K3ftTV>-5&©r-*7 

-H*^Te2isn«.*-c W4, *fj&-f5F ifoi 

9 2 0, 1 9 2 2 j&t&Cfc-Sj&V »H*tb IFIF019 
3 6, 1 9 3 8**-Wt»-5*t?) hfltSflOfc* 
3nfcV»«fc3KDRAM7£1*;^©ff#«ttfWBB3tt 
•5. 

[0 6 6 3] £T©7£-feX#-h 1 9 4 4, 1 9 4 
6 , 1 9 4 8. 1 9 5 0 tisttl^tllZtt&t&rt-X h 

1 9 4 2K:7<.y^3ftTV>5. Z.V>1jV>9 
I4#- h Cfcff 5® 0 5I€?©&*©;<7 1/ > h 7 H V7, £ 
«M»l/T*D. Oa*SnTfe, VOTfc 

iEtV^ ; E'J7HPXT^Wrs«P*tpIffiT?*§ 0 £ft 

7!?7-4yiS.T> R AM1K ^;P©7 h* 1^7. ©***#•! {fc 
SB 1 9 4 OltiOiiftSft, fT7H^X*'J'>^ 1 9 1 
6tJI7l<l/X*«»^l 9 1 8fc»&n*. 7FI/X 
©MNtfy h«?»J*7>^ 19 18 fcAfcSft, 
©±(S7 Fl'Xh'y h(4ff*7>^ 1 9 1 6^A2j3ft 
•5. #Sfcffl$ 1 9 1 4 14. DRAM©ff7h*^X^'f A 
©Wltttfr*7>^l 9 1 6*&*%U7l"f 19 10 
^f57H^*ffl*b, DRAM©#I7 FV'X^'f A© 
IHCItfl*9>*l 9 1 8^6917 ff7 
HWX*7>^1 9 1 6t?«7F^7.*7>^l 9 1 8 
14, h©SJ&^C*ViTt>^ ; E';7W'f 
DRAM1 9 1 O^D-HStlS. HftBU #-MK 

*IC^Ttt**^Tft*. J!I7FI'X#*>*1 9 1 
8 14. ■€-n-en©^ ; EU's©45SI*«iSSfc«lC-i' "J 
*>hStt, frTHWObfr^l 9 1 6 (43*97 H 

9 1 8AHtoCXfc«fc<f ^U^hSft 
*. «tO«*!C«/t-^h*»||7Sn. fffcfcfr7F 
W 7.Tf?W 3s txfc tttitfft S ft w 
[0 6 6 4] *HISfifiIT«. X%U7W19 10I44 
X8K?WVf h9-f>*^T?*D. 7-Flfc0 3 
2tfy h**MFr*i:«tUTV»*. SIC -5-tl-etl© 
f§a^-M9 4 4, 1 946fc^n4/Wh 
©§£f&*<*-7*;Hl#©-fey h 1 9 5 0 . 1 9 5 2 
#*b* tatfMlc5 s ->*»**07t'-r 1 9 i Oft©* 
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x-^©##ii^lC*H--5VX^«:ffiiclC^ttS*^pl 
m-Q&Z>1ttb. ttfct 5 F I F O 1 9 2 6 , 1928 It 

•5-n-eti©x-^7- Kt*c»#a*-f *-^;m* 

SBfATiK&g****. dft?,©F I FO 1 9 2 6. 
1 9 2 8l4§£j£#-F I FO 1 9 2 0, 1922©3> 
h n -Mzm V><=> ft -5 © t (SI Uffl #T 3 > K u -Jl> 3 ft 
F IFOI 9 2 0, 1 9 2 2^©x-^©#$ji 
3-lC&S£3ft5 3 2 £•> K©f^to0lr4lf«y h©*# 
fflV^ftS. W«C. ^SfbSGl 9 3 Oli^SfbSBl 9 
l 2t|wji;j:e>lC3>hn-;i/3ft5. iM?3ttfc»* 
&*-f*-7\)H4, fMWl 9 3 2^X^3*1, MM 

i4cns©flif**fflK &mm \ 9 1 2irj;o^ : &u 

7K 1 9 1 O^A^ftSg^&^x-^-trlSJ^bT 
^'J7K19 1 0(*J©7KkX3ftfc7-F's©» 

[0 6 6 5] B 1 4 6 ©WSttlWBffi 1 9 3 2 ©fH»T 
T»fFr*>. B 1 4 7 14. B 1 4 6 fc*5V>TfWW«& 1 9 
3 2 ©ftf^©^«€r^-r:KSiBT*5. A<7-7y7*© 
HtU-fey h©^7^flC, WHftlittflttlCI DLE 1 
OOtf&ICftO. d©!Kfi8t?-r^T©DRAM3>Kn 
-;HI**H > >7*:r<r:/ (high) KteO, #£{b 
951 9 1 4l4fr7FP'X£DRAM7l-"f 1 9 1 O'vSI 

u 7 1/ -y •>a*&iwi';«*w»iiisn* <t, 

RASDEL 1 1 9 6 2#6g^&3ft-5. #©:7ny 
£ X y 5?Ttf-f :7 JPgjfc £ ij 7 P y -> itfft < ft -3 fc 
£>. tf&ggtel DLE 1 9 0 Ott&lCM-S. -E-^rftV^ 
t, DRAM tRP (RAS/'Jft-^'fS^ 
MRR) S«3*^fc3nfc^flrRAS 0 N 1 9 6 6^81^ 
S^Stl, ^©Rf, fr7h*^XXhn-^«^RASJ4 
n-|^;HCft5. t RCD (RASd^CAS^©il 
®^-f 5>yiM«) *^fc3n&«. COL 1 9 6 8« 
*^«»atl. DRAM7W 1 9 1 O^A^-TSfc* 
©^7HWX*S^-r«i5lC#fi{bffll 9 1 4j&*X-f 
«y?-3tlS. *©^Ds/^i'TCAS0N1970 
ttffifcg&Sn. DRAM9B7 Fl'XX hD— 7* (CA 
S) M^A^^x-f 7*D-Cft5. — H., tCAS (C 
ASTi/T-^^^-f 5>^WK) ^fc3ftfce., CA 
SOFF 1 9 7 2tt$8N&£3n. C©«fi8TDRAM 

^HvxxhD-^ (cas) \mzM>7#7-4y 

A-T ICft5. tCT, Hft5x-^7- HW6B!3ft3 
itirftoTV^i^ir, it)i«v^7-f*U'r^©-9- 
'fi'Jl'S*^ U7W«y->i*^UiioTftVi*\ 

»wa-r5it»4ji-r^-5Ji^. we-itcp (c 
Asyu?-fr-^'f5>^«!iis) m&mitisnitp, 

CASON 1 9 7 Ottffi^ttJat, DRAUm7\ t UX 

xhti-7* (cas) t4wr;7^f-^^n-cft5. 
tgft55*-^7- HoteSWb:k 1 &teil#&*tB£ 
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-f5>iflHIB) *<WjWt&Snfc&. -t Oft t) OCR A 
SOFF 1 97 4«I'sI^n^. £©#&•?, DR 
AMfT7KP'7s7.hD-7* (RAS) (Uttt-f >7£x 
>f tfA-f *O^D*^I«/i?Tttiga I DL 

e i 8 6 ott«e*»u *o-y-f ^;i/Mtt*«P*"r 

-5. 

[0 6 6 6] RASDEL 2 1 9 6 4ttlT'j7l"> 

->a»#**ttai*n*(t. — s. t r p (RAsyy? t 

-v^-f S>7"3» fimtiZtltz*,, R CAS ON 
1 9 8 OtfSBKjS&Sn-S. COtt^TDRAM^JT K 
l/XXhD-^Ti'f'f T'P-ld&D, RAS 'J 71/ 
•V->jL+»-i'i/;i'©WlCDRAM CAS&fflSS-r*. # 
C^ny^I'r^igHiRRASON 1 9 7 8^ 
frtX. DRAMfr7 FUXX hd— 7* (RAS) «77 
TjVa-iZfcZ. tCAS (CASJ^f-f^-fS 
>^S5PB) tfffitzZn&tm&teRCASOFF 19 
76^frt>n, DRAM987 Fl^XX hn—7* (CA 
S) tt-f>7i>f-f9/Wl:ft5. -fit RAS (RA 

s79T4V9<f $>{fmm.) i>mtzt<tiztmmzR 

ASOFF 1 9 7 4^frfc>ft. DRAMff7 h 
D-7* (RAS) >7^7r--i"!/A-f (CJ&O, 

D RAM1M V ^©£££±12© <fc 5 l3.M2>M^$:Wm 
U IDLE 1960 ttfig^S^-TS. 
[0 6 6 7] 01 4 6®U7l/!yya*^>^ 19 3 4 
«m*SlC*7>^T*0, 1 5V<^D^>afet)lC-|5I 
©B^-K sfcte#»DRAMg#©g#l;:.fc9?£3: 
ofck-hTU7P-y->3.M*«^«:5S4Sli--5. 'J 7 
Is? 5'jLS:fcP%fT3ftS£. £©S#ttB 14 7©tt 
®SC«tt)S4jI^n-5*T?%ff!Kffi?fe^-t«„ £©77 
7H^>HJ, ttWRCASONl 9 8 Otfgg 
lrA-3fcRrlcfft>n» «IIS**U7P-5/->3.SI*©«t* 

*^ar-5 *-c-?-©«ffi*^Jt5. 

[0 6 6 8] 014 8IC«, »^3-H7*-A-CHl 
4 6©#®S1 9 2 4©BW&JfiS3nTV>*. ££T 

tt, 4^©-9-r^;PB*siff*©+rifnc^ ; t»j7i/ 

-f 1 9 1 0^©7*-feX£fra-r-5*>£«ft»5:&ffi£, 
7*"feX^©&¥3 tfitefeftClK ^;UB*«©7*5 

&©3- H»cfflv^nfc«»#;w4H 149 »cKK$n 

[0 6 6 9] *n^ft©£2fi$gfT£tt, -t©S*©7*7 
-f:i-<j7V££-f 4fcfy hftNF?TV»«. ±ffi©2t-/ 
h«-S9:©*J«k^7.^IC|2^3nTV^«Sfi!cffl»C«k 5 
^)KW^7 r 7'f*U7 i >f iryj-fey hStlTViS. 7°7 
-ftf'JTV ©T<fc2 bty Kttfl»&f 2 4fc«tt)S«f$n 
■5 2 tfy K*«7>^fcfej&&nT(riS. .#fc©fe*«& 

tes&ic. ^i9l|(i*i£l-a%© , I^©^ 
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©7 , 7'T^»j7 L ^^'7>^©<il«i2fDtI^0. |b]-©± 
tt2K? K©7'7-1 , *U7 : -^ffltg?# e kD<S;UTCS2 f 
>y K©7°7-f ^'Jx^M«;ito<i©g*#©Tfi2tf-v 
h © 7" 7 -f * 1 J -r -f * 7 > h tt £ T 1 1* -P -f > £ ' J ;< > 
h$n-5. £©£*. ^t'J7K 1 9 1 0's©7? 
•fe^t« : BI3nfcK*#»4n-©±ffi2 f-y h77f* 
U x-f **ttoS#tf©W|-C«fcfilr>7"5'r * U 5r-f IC 
ft*. ±tt2 t*y h-©7"7f ^UtV flMMfittiio 

10 S£g$tlftK 7"7^:*'J7V©±&2 try h©ffi(iM 
#f©£tt«fc:/5-f:*y:r'f Tffi2b*7hO 
Itt^-©±ffi7"7'ft l Jf^ ©»*#©IBJT&¥fc# 
&7x*-A£§|:glbTV>3. iOXt-ASfflViSid: 

(-t-n-en©s##©±f!it2 if •> d^gB 

V»*», -gB©±&2 My h7 p 7'f t'Jf-f a<ffi©£gfe 
3) . l&mzftW-llAtimx. (£T©±&2h*yh©7° 
^UT^ffiiWl-) $-C©V>3V>3&#&X*-A 

[0 6 7 0] si 4 9 ^-n^n©^#icM-r-57* 

£•> h©®jg<!:^©l:-y h©#JJH8;£^b 
TV>&. CCTH Ml 4 8lCfflV^nTV»S->>Jl?;i' 
©jS^fc^StlTV^-5. 
[0 6 7 1] ±!2©mS0!|-r#a©F I FO 1 9 2 0, 
1 9 2 2. 1 9 3 8. -tft^S 1 9 3 6 &ig3 2 tf-y 
K 3 2 7— KT&S. r©a?Stt^)$t^S$n 
5[IIg§XU70fffl©&V»&T©gt8£#;LTV>-5. Vfr 
U SS3©tett, /17*- V>X©&fci:#l;:t$5£©7 

30 7"'J 7— >3 >©--xic-g-fc-a-T^x. ^ns. 

[0 6 7 2] £&. ic:cs$nTv^4#-h#ij^n 

ttS$ji*#-h©vi-fn^t©nt¥-©F i foa* 
-y7r*ffl«-r5fcntTt>sa*tt#sti^. l**u 

[0 6 7 3] 3. 18. 9 ffit^a-JP 
ffitya-f2 3 9tt. 37*D-fe5/-tJ-2 2 4©S!lfP. I) 

40 tor£tt±<&x5-tSJ0ji*.ffi#©T;^7'W^-> 
CBu s©rtJBt^«7*-Ai©M©'f>^7 
x-S^a^lWfc-fcB u s «#©-j»!/n8BC b u 
s ffl* tf y^WVJ-ifVir 'yyflt ZvltibO!? n >y 
^©%4iS«*fT7. ^)i&<l6 ; E:^a-;P2 3 9©Sdf^ 
tt» fflVi&n«AS I Cri; juv\z±z>>;uy*>>f 

t0 6 7 41 3. 1 8. iti ^.ffi-f>^-7±-^^l 
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[0 6 7 5]#tc, r^uiry+tti^x h:/niry^<D 

^r/WX^£tlT£D, 3yn-feytJ-t«tOt*S 
ttfcfft^7KlxXS*X hyn-tr^^cD/^EU^^^JS 

U^a^UVylf >^xA^Xt4, #774? 24 

[0 6 7 6] iMR-f >*-7x-X«HI* (E I C) 2 
3 814, n^D-byiJ-OPC I Bust-KBusA 

*x h s/xfA ©%i7 f i^am t ©ra*o& o< * 

«2 3 8tt, AM>^-7x-XX^f y^2 5 2^*> 

-f\ JB**-3tf^--f1f 2 4 9^608*C|SI;t*X h 
U Kx-^fcS^iitfRfKPC I Busl07X 
^tUTfHBrr*. PCI Bu s^©7^-feXWU 
"PCI Local Bus Specificat 
ion, draft 2. 1" PCI special 
interest group, 1 9 9 4(Og2ptC^ 30 

[0 6 7 7] >^-7x-X$ffllR2 3 8*4, A 

M>^-7x-W7f2 5 2 i«f**-^>rif 
249A^OPCI »D3l*©&»©W«fB*t#«r 

O+^yS'A^-f >K*IHL*\ *Xh©lfr<!:2fT© 
IH©*irv5/*9<>©*»lflUi:, 2frXtt*tteU: 

«E>ff«&^na'f>^-7x-^M«ia2 3 8lrJ:0 40 
ASStl*. St**>f>^7i-X(««2 3 8 

[0 6 7 8] *»f >*-7x-X*M*2 3.8 0«flS 

«^ ; eu^e > *xh^s^q6u^7Kwxv^b!> 
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©PC I Bu s^0?l^7Hl/X^Mt7t 
#rt*PC I Bu s^R0 3l£&B#T5t>3-o©7 
-fyf2 5 2^IS«*-^>f1f 2 4 9a^ffi*$n& 

[0 6 7 9] ^t'j7^>h^F (MMU) 
(01 5 0©9 1 5) tt, 1 6m<D)Uyi7 7^ KAy 
77 (TLB) atS*fc&oTV^. TLB*4{£g*fft 
a7Hl^XV^t!>y©^y ^a^bTfP»r*. T 

L B T14&© J: o fcffsman*iKTafc. 
[0 6 8 0] 1) ttlft: fiaB7Kl/X*»#*.5tt*£, 
TLBf4#*r -6*317 Kl^X^TLB5X«-t (7 K 

i/xfev^^-rswaftx^huata:^*^) ©v>t*n 

2) «*:TLBfcHU K#X> h U*#»TfcV>X> 

[0 6 8 1] 3 ) Umt : &3®7 K VTsifitt £tlfc 

[0 6 8 2] 4) ^j|«lft:t^T©TLBX>hU* 

[0 6 8 3] 5) «E*fflL : TLBX>hU©fiE8^* 
S7Kl/X*4> 4fc*y h7FI/X^-XT^ffl£tt 

[0 6 8 4] 6) TLBX>by©{Rffl^« 
S7Kl'Xf4, 4Hy h7Hlx*^-XT«#a*tl 

So 

[0 6 8 5] TLBrt©X>hUttm 5 lfcS+.fc'S 

Utt* 2 0fc> h©<K3g7Fl'X6 7 0, 2 0Kyb© 
•«7 H IxX 6 7 1 , *tl*6#lW-**a^-^«* 

*a*prtB*5*ft*-r7?^«*aft*. x> h u 

S?X^4, tt*fc«V>&ftfclOK*h*T©7Fl/X 

TTLB©^-y»44M/t>f h*Tf**- hStl*. V 
X£l/$?X*ttlO©*T*«5&»* f^TOTLBX 

[0 6 8 6] TLBfctt, "Least-feeceht 
ly Used" (hkii) fe&7)te*)X&W&Wt> 
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[0 6 8 7] HI 5 2ttTLBlkftaf^Oflittl*S-f, 
S^S^nfciEST Fl/X 8 8 0H8 8 1-8 8 3 ©3 
O^gB^lC^tt^n^o Tft 1 2 fcfy h 8 8 1 te^lC^ 

Fl'Xtfy h 8 8 BM^l^ HtiSSftS. #£>1 0 
©ffi^mvfft*vT?*5. l^X^ 8 8 7 fttf) 

1 0 7F^Xt:y M41 07X*h> htUPijJV 
It "AND ED" TLBJb^T^y 
CO&ftfcTtt 1 0 tfy Ftf><K$^-i?#^8 8 9 
§o <ES7 F UX£>±ft 1 0 My h 8 8 3 tt, {gg<- 
8 8 9 <£>±ft 1 0 tfy h bT^-f h fcjg v> 

10 6 8 8] ^©J:5fc*ri6Stlfc2 0 tfy h<Dfcm< 
-i?##ttTLBlc:S6tiS. 2in*«x>hU©ioi: 

TLBtt»lStS»I^-^8 7 2 £ 
-»bfctt«<0##*fi-r. ^317 F WX 8 7 3 tt, V 
7s$V*J7s9% 8 7^S^ffi^T%g^-> ? S^^4 
fig$n§« *S^-y##8 7 2 0±ft 1 0 My Y\Wi 
87 F 1^X8 7 3©±ffil OKy hiLT^-r hit 

«317Fl^X8 7 2O&C0 1 Oty M4* 

(#*-r*vx^br^h*«ifl!>»d) * 

<KS7 F l^X (VX^ h*y hjjt 0 ©#&) 
*>68 7 5C«lStt*. «S7FlxX©Tttl 2h> 
F8 8 StifiSBTKl/X^S^-f Ix^ hfc^AStt*. 
[0 6 8 9] 7^5 1 l:ftViLRU;^7r8 7 

tO 6 9 0] TLBSXtt, A*-f >^-7x-XX-f 
yf*2 5 2^»**-if^>ftr2 4 9A*TLB 8 7 2 It 

£<E>®3\ MMUtt»*Stt&7£*X«!>»l 
SttftSttC* *X MtU 2 0 30^-?f-^** 

t0 6 9 1] ^-i >?f-^tt*Xh^^>^ ; EyOA 

>r>ar-ym§. *n*tt©<*--$*T--:/;i'X> 
HJtt» HI 5 3fciR?%fc5&7*— h02:3©3 
2bfy h7-FT*lrtSttTV>S. 2#B07-F»4« 
a7Hl^X©fc»O±tt2 0liyhft*lJ*U Tffi 1 2 

tyhttwasnxv**. *f«f«<R<B7l«t/x©ltt 

2 0 My htt«0©7-r l*fc4*.&*tTt>*. tft 1 2 



[0 6 9 2] ^-yf-7>X>hU It 14* *£#HtT 
LBX>hUtHi;»«*^*tlT^S. ^-i?f-^ 

IH4, att^O^^EU 2 0 3ft<9*g&©^-v(tb£ 

[0 6 9 3] MMUftf4, 77 F^x7ltcfcD^££tt 
fcl 6 CO^-vx-^l/T^O^ co-try Fi^snTfc 

te, 3^DtyD-2 2 4*«6 4KM h^XO^-y 
f-7>^#-hl, 8K^-y7yt>^*ttS 

Att£V>T, £n*4§*3 2M/Vf h<DVy If >y$tt 

fcfiS7H^xamft**-r« 0 frb^-v^-:/^ 

[0 6 9 4] TLB3X«©MMU*ffsWU It 
20 HI 5406 9 OCSITM, 

[0 6 9 5] 1. TLB CflFaL/&V^fi^-y#f 8 
9 i±oy\7'>a7r>^ya>8 9 2^ffU ^- 
yf-^Kl 3 If y h<M>xy^X£to£T£. 

[0 6 9 6] 2. ^f^JK>T7W8 9 4, 
8 9 6©±ffi4fcfy h8 9 4*JBV>, 
< >*8 9 5£jgiR?-£« 

[0 6 9 7] 3. 2 0h*7h^-^f-yM^f>^ 
8 9 5 M-yf-/M>f^X8 9 6©Ttt9t 
y h £»JSU ST& 3 If y Ht 0 0 0 «R£?-* 21 1 
30 fccfcD (^->^7 L -^X>hUJ4*Xh^ ; EUrt0 8 
/Vf h££a&£&&) , B*Stifc^-^T— ^x> 
h U 0®W7 F kX 8 9 0 

tO 6 9 8] 4. ^->^-7^x> h U ©$!3S7 F V 
X8 9 8jfc6tta&, *xh^tU*6 8/t>f h«K*HJ 
To 

tO 6 9 9] 5. 8/Hh©^f-^X>hiJ9 

0 0*«PC I /tX^SSftfcfc*, VALIDfcfyh^ 

1 lC"fey fSnTV*n««B^-^##»TL B 5X* 

40 7Lfc\,*t. ±E<O^DirX*fflViT^O^-$?5"-^ 
;PX>FU*<7xy5 1 $ns (•«7Hl/Xtt8/t-f h 
T^>f>^U^>hStl«) . £<D}m\tVy 

g^-^#^<D^~^-7;ux> f u 

t?» ^4^a^^-v ? x-y;ux> f y it a 5 st^w- 

e>n^o «a»a:<-?x-^x>hyiaiofc«^t 

tt, ^-^7*-*hx5-««BBSti»att+Jb'r*. 

to 7 oo] 6. Ty^-rftfiait-^^t^ri"*^- 
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[0 7 0 1] -ttl*5TLB©it«W*Wllrffffotl, 

»«#pJteKfc*u «fbv>x>hu*«TLB^*€riis 

tlZt* LRUAy7r 8 7 6ttSBR2n£. 

[0 7 0 2] E I C2 3 8lcmS*tlTl^Ay->i7 
7>£va>8 9 2H, 2Obi7h©i0i^#f 

(vpn) KfcfU ^©^SjiSffl^S. 

[0 7 0 3] index= ( (vpn»Sl) XOR 

(vpn»S2) XOR (vpn»S3) ) &0x 10 
1 f f f ; 

CCT\ SI, S2, S 3^ffli:W^7 P Dy^ApJ|g^ 
y7M (IE, 304ft) tf, *n«en4C)<0ffl*SSC: 

[0 7 0 4] ^-^x-:/;U©«*R**UK/W 

^7^n^ c fc^^^(c^< tblrwmZb (nu 1 20 
1) X>HJft*A/l?l>S. 

[0 7 0 5] 77 h^xT^X h^^Urt©^— 5?* 
-7 > ;i/X>h>J«3 7 p n-fe^^2 2 4©TLBI:+t7 

teiK * ;n£*fc-r z.t\z& d ff ton*. 
[0 7 0 6] «iaft^^;Hi»b^*§l^er-r 
Hyht*K«»flsan*«a^--^*#*u mmu 30 

JBMrfHKB. S»Stlfc«i^-$?»^©fc»fcTL 
B±'T*fcStl*. TLBX>hUK:Vy^f££* X 

> h y Haascr-* an, «^3nfci»w*«!>« 
*fi*wB v^n^ct^icLRUT 1 - ://wns« $ n 

So 

[0 7 0 7] *»^^«JSft^HV^feS*^OT 
LBH*J:DJl5V^5-f*U^ft»t>TVi*. fcttfc 40 

*©*SS*fl3ffil!*«flII6Tf * £ 21 i 
[0 7 0 8] MMU^B^anfcfiSTKl/^.©*:*© 

^n$^—^7^;uhiv^7o MMu«x7-ii^*ai 

U 7^;H-*firtfc{ESTHWX*77K^x7^ 
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[0 7 0 9] K*Bll/9fflt7-^3n& 

nfcttfcfc^-y^^i'htfttians. 

[0 7 10] ^ffi^>*-7x-*SJ?|ffi (E I C) 2 
3 8H -ttA*X^7Hl/X3nTVi§A*-f >^-7 
x-X*< y^-2 5 2 t»S*-^-f1f 2 4 9**5© 

ao5i*s#(^i;<&n§, w?n©g*t^i-ji/ 

ttS£<8fi#*MttA;*ffi*>*5W4PC I AXfflA^Sr 
^fc AA^f>?-7x-X7>f -;?2 5 2 t»St- 
#^-f1f 2 4 9 t©D5^-y 3 >CftIAXSffl 

u^><Dt\muo, -kaxb*^©e i cai^ttpc 

Ig^CDS^ir^tC^nTV^o Hl-E I C 2 

3 8H -H/USBi:^ l^i? H:7 K l/X C b 

[0 7 11] HI 5 0f4, *V«S>f>;5r-7x-XfW»«R 
2 3 8<0«Hfi*5RbTVi*. IBusS*li#tft«9 

i osao, ^Sftgp9 1 ott»*©awffl!*fetici/ 

T (PC I Sfctt-jft/tX) !Sftft»t5;a-;KR 
H8/U^I*lt RBustCBusfe 
J*oTV^HRAX«flHB9 1 l^SlftnS. RBus 

#*«v>*fc», d©/«k:tt**ft»^i&Ktsn& 

[0712] PCI /U^i^nfc I Bus I B 

usH-7-fA (IBD) 9 1 2lCJ:oTftt>n*o l^4£ 
tC, PCI^CRBusI*liRBusk>-A (RB 
R) 9 1 4fc£oTjfi3I3ftS. IBD9 12<hRBR 
9 14IJi£S7HW^ •«7HWxeiS'r^ : EUv 
*^>ha-yh (MMU) 9 15K12I*. IBD, 
RB R, WSMMUH ftlftlPCI h7>1fi7 
^3>*B*t?*T, cn^HPC I VX^t-FftW 
SK (PMC) 9 l 7KJ:^T^rtan, n>hn-;i/£ 
n«. I BDtMMU»4PCItt*fflb©»L*K*U 
RBRttPCIftaW^*t5. 

[0713] w«opc i^^h^-.HMii; (p 

TO 9 18*4, ^-yy htl/T3^nty*^7H 

cn«cB u s vx^^ Ptt^mm^SD, 

Ir^TOfo^&a—n^OT&tZ&^&iZt*. PT 
C(4, S$nfcCBusf-?*PMCg*7?PCIA 

P c I r-^/Ut!>©a > hn-;M4 
#-©y-x*6ttian*. 

[0 7 14] E I Cl/^X^ir^^a-JM^U^ H 
bXan^CBu s h5>1f^->3>»4aHPCBu s-f 

>^-7x-x7iaoiat>n§. ^©-y-yt^a 
-;n43>hn-;n/s?^^sify h*t>6vv 

4 >>-7xK3<W^fetftii, I: Stllill 
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Tr{mt%/V)T'<3Lf&t3 L x.y2 (PGC) ^Eva 

-^9 2 lKckoT&gsn^o ^^nrc/vjT 1 ^ 

[0716] HI 5 5tt, il50(DIBus 
9 1 2 ©«jfi*^UTV>-5. SttAttfc I B u s 7 F 1/ 
X 1 3 > F d-JMi^ttlM £ )V<D&&?y y?2tl% 
9 3 o„ ^-jy-h9 3 ltt-y--f^;KOjfi*D*ttffl 10 
U n>hn-;i/D5?y^ 9 3 2 fc|lM&m*f £IS£-f 
5. <£^"^S^^^*r^>^^^9 3 0©±ffi7F 

y#*twu 9 3 et^y^snfcfta^-^ft^ftiiT 

MMU9 1 5 (01 5 0) ^2t£tlSo 

[0 7 17] *g^-^#^iTffi^7FI'Xliy F 
tt, 7X^7 9 3 7 t:«toTIM6dSn, PMC 7 17 

(010 2) ^PC lB*©fc»©7Hl/X9 3 8£ 
*fc, lK^JP©fcfc0/t-XF;S£>F*> 
#9>*9 3 9fcn-FSn*. yU7xy^»fmS 20 
U%i3*?>* 9 4 1 £7 F^5y?itfcMa*9 4 3 

[0 7 18] PMC36^jgStlfcx-^tt, T-*1fi7 
U7xy?©— «**5)5>*STv-;&£#fcF I F0 9 
4 4fcD-H*n$. 7*-***F I FO9 4 4©ffl0flB 
#Tttffll5«fc&oT<£^ 77^9 4 5, 9 4 6& 
STKiMttU n5?y£fcj;0irny*7*F;S*i*. 
K»ffll/U^^9 4 6ttl Bus7^;i/r?^>b 

[0 7 19] t*a>hD-;^Dy^9 3 2tt, Ktt 30 

tl* 6 P M C\©-f 7 x— X 4 3 > F D— 

[0 7 2 0] &S^-S?##;*/£>^9 3 5 a, IBu 
s7HWX*6<D^-^##tfy FT* IBusF7> 
If^g^Mft&AfcD-PSn*. £0!>2Olf?F 

£<s. tiki otfy hfc»UT«t, *n«n©e^h 
tt^js-r^vx^if ? h 9 3 7*t i jc-fe^ fs*itwi 

«£ttAttS7FlxA^&D-F3*U *-3T?fcV>£, 40 
[0 7 2 1] ««©IMIn!, i»^-5>##ttffl»7 F 



[0 7 2 2] ftiTFWH MMU*»t»ft 
*Stft«l:7 7f 9 3 6 0 

^fcfyFH, igaftfc*a^-^##t7C©<K«7KU 

[0 7 2 3] mWT KWX*^>^ 9 3 8li«I7Hl/ 

X77?9 3 6*^o-F$n§o e:tittPMC*^7 

-> ? l^ltSoTV^7"Kco»^2oWTTS§C£^ 
tttBI-Sfc, 3>FD-;PD$?y£ 9 3 2 £«#*HJ 

BK«i;T*ffca7HU^*lftfeK#r*. 

*ffbl>7FUX*ift«fcm*n-F3tU PC IB*** 



[0 7 2 4] A-^h*D>^9 3 9lt h^>if^-> 

3>©JfijftnM Bu s;t-xhttt*i:D-Ksn«6 
h , 7f©^>^')>^t$.§, dtlttPMC*^7- 

#*2^JKTK:&5£* 3>Fn-;Hn>^ 9 3 2 MI 

#stHU :nt2o©f-?fiM, pci ^7 >if 

pNBTfctrMRD) «. 

[0 7 2 5] yU7x^7HW^?W9 4 3tt, 
V^^y»j7x^^OSa)©7-K^a7HWX£ 
*CD-FStl*. «<IBus h?>-!f^^3>^H 
»U ^tl^57 p U7x^5 1 *^>^^fe<£ , blt? 
07-KA^P5<^ , j7x^^$tlfcrt*^b&e», F 
7 >if^ v 3 >©«B©4WI7 F l^X^U 7x7^7 

Fi/^©ntit«*n«. wttPvy^T*** yu7 

i^T-^J I B u s Sl*0 *«fcT©K«V»6tU 
SftlC7 r U 7xyf*n&7- F©«©7 F WTP C 

[0 7 2 6] 7y7x?5 L #£>*9 4 1 »4tf ^ F<Z> 

F#SSn*fctffc**A*F I FO0*3&Hi;#»> 
>h*T-f >^y^>FSft«, ffi< I Bu s F5>1f 

7xy^*^>h^7FlxX*^>^ICM$n, -£na> 

s/x-* h*#>*j&s§i)fr*u pc i»3iwqf*3n 

*«MTllttT**J:3fcfc*. ftfeDfc, IBusF 
7 v £* >**:/U 7 x * tlfcx-^ 
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[0 7 2 7] x-^F IF09 44I1 8 7- KX 3 3 
\iy h<D#Hfl!7*-;WX;k-F I FOt$§, PMC 
T^bCOx-^te, x-^7U7xy?<2~gKT&3^ 
W^tt^y h«l:F I FOCf ^^^n§o FI 

F0A^tt*ffl*niBus^^ft5. 5*- 

* 3>hD-^ni;7i7^^©ff^l fif 

[0 7 2 8] ^15 611 H15 0ORBusl/->-/^ 
9 1 4<D*|jfi*jRLTV^. 3>hn-JH4 2^©«IB 
S9 5 0, 9 5 1 fcOBW^Jy h^n^o 
«HS§9 5 lttRBu s^>^-7x-7^>h 
n-;H"S. 5 2liRBus/Wh© 

Htt, Wh>f*-:/*fc#fcFIF07 5 4fc»S& 
itlSo F I F0 9 5 4Ai3(|S«-rSJ;5fc&Si»#i& l 
*P^;?9 5 ll:<fcoTr-l/7>f«t)iS$n, * 

So 

[0 7 2 9] «€r&*n$?y*9 5 1(4, si^WMMm 
#*^LT^-f >«IB»9 5 OfcRBu s A*-X hOH 

l/XKy hH#£>*9 5 7fcn-F;£n*. <KS^~ 

9 5 8#SSW. im^-5>#5fc<K»7Hl'X0T 
ttKyHlTX^fcltoTWIB^atl* #^>*9 6 0 
fcn-FStU PMC^OPC iS&OlttiXDTFUX 
ftftfiW*. PC lR*©*tl*ft©7-H©fc»©7 ? 

7x-xn > h n-JMS-^feft) >n > h n-ji/D 

S^#9 5 0fc«fcoTF I FO 9 5 4#^Uy#7 

[0 7 3 0] »^&^ttfiiS9 5 lit, r-7HtJP 
*fflV»TRBusyi-^h©*T*««B'f4&. "TS^F 
I F0 9 5 4^<0x-*<2n-K£+iLU ;*-f>ttffi 
«fcRBus/t-Xh*«7bfc2:it3WSl-r*. 
>«ffiSttx-^F I FO#*afc&*SrCPC IftftS 



So 16X>hU^iJbi77it^HA t 7 7 7 (TL 
B) 9 7 0*4, TLB7FI/XDy-^ 9 7 1^6>A* 

^TLB3>hO-Jl/Dy7^ 9 7 2H RBRif; 
tt I BD^5TLB7 K l/XDvy £ 7 7 £trt 

tTLBK<fct>TfTt>n*fpi**a#!-r*. *r&fcTL 

10 B**H, tfclfc ^*Bftft, 

TLBA^JHI/XWy-XtLTIt IB 
D<hRBR^>?-7x»X (ttt^ll/B) , ^vr 
-^X>h'j;Vy7r9 7 4 

ffl) Stl4TLB7Kl/XP^^rt©l/yx^&i:*« 
&So TLBIt TLB3>hn-;un^y^K-£tVf 

*31^-^#^ttl BD^RBR-\SIOigTo TLB*4 

gfcgie7^-fex$nfc (LRU) ffifta&flfc&a# 

U cnHTLB7F ; l/AU^-/i7tC^:oT«S^a^ 

[0 7 3 2] H^JfoWyftbfc*^ TL B 3 > h D 

-)VU*?yi? 9 7 2(4^-^-y;i/7^^X3>hn 

-jpd$?7^ 9 7 6kpc im#*Mf&-?z>&vmn& 

at. ^-yf-^7KWWl/-^9 7 7*4, ft 

#^*t>tfcPC I7Kl^A**J5)6-r«. PCIB** 
5SB*ftfcr-*tt* ^-yr-^H> h U Ay 7 y 

9 7 4^77f^?n§, s*s*i«<Ka7H^xfcvy 

30 ?#WTLB7Hl/XP?yfr 9 7 7^«6tl, 
ft^-5?T— ^U7^-feX3>hn-jPn5;y^9 7 6 

tt^-^f--^;U7^-fex««^7bfce:4:ft»srr*. 

*tl*6TLB3>hD-JPD$??;^9 7 2»4, TLB 

So 

[0 7 3 3] SC I^<Z>l^X^{I*l£SC ijfc&OW 
^X^fl»HtW*©*lftKIIHJII3n*9 8 0. «#tt 

-f >^-7X-X9 8 1*4, S*PCBu s-f >^-7x 
40 -^eTLBM-e^f-y^^^tUI*^ 
©7^-feXftT3-Hr*. TLB7£irX*4tfc*ajL 
*/BT, r-^*»*fc«6fcTLB3>hD-;PD^ 

&^Pa#pJfi6T, ^v^-^^U-f >^-7x-XK: 
£oT^l^M;:7£-fcX3ft*. Ctl&OAWfctt 

to 7 3 4] 3. l is. i i &a-r>£-7x-; 



( 104 

205 

^j2-\Sfci££n3o PIC 2 3 7H IB* 

#D y £ -f * - > £ 
[0 7 3 5] 2 ) tf x*A*^- K : FT, x 

-^ti^fflfx^O^^i^n-;^ • x-^-f*-^ 10 

[0 7 3 6] 3) -tr>hD-^X : E-K : ClO^-K 
IEEE 1 2 8 4«JpK3£g|SnTV^*«p^n 

[0 7 3 7] P I C 2 3 7tt, >&mzfoVT, ftS&x 

[0 7 3 8] lfl«fcffi*T-*©V-X£LTtt, LM 
C2 3 6 (ABusSfflM) , R02 4 9 (RBus 
£ffiV>£) > *tl*^— j&CBu s^3^W5tl 

5, pic 2 3 7it c:n^^-^y-x^^©h 
-xfrzoh? >*?{7 *s a >a*© y -x $ 

teCBus, ABus> RBus ©y^-Y^UT^ TJ® 
[0 7 3 9] 9*99, *^a.-*HP I C©I*J«W7 

o con > h n-^TTSwrr 

[0 74 0] Efc, CBus>f>^-7x^992 

2 2 4 fe^LTHaf /H^*7^ t 

9 left*., ABu s-f >^-7x- X9 9 1 £CBu s 
-f >^-7x-X9 9 2ttH*tt>;t>f HF I 

FO^£tlTV^ffi*x-^AX9 9 3 A,f-^S8 
ffl*x-^AX^(D7^-feXtt, if©y-XAtfflA 

TV>**fc*fc?x^^*^^ 
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Kfi?H:> h n-?X7-?-f >?-7x-x^Mt 

^^niV^, l^d, fx*A**l»ffl9 9 8»C 
tt, ^V^tlT^^^^f c C^h*x^A*^¥ i fe3> hn 
-K§n^7^^^nil^o fc:x:*A*S!HB«9 
9 8teA*x-^ArL-y h 9 9 9^ffi*£fcBU d 
n».?f^fx*A*»JfBffl5 9 9 8frir>hU-#ZMffl 
SR 9 9 7^C0V^n^(^cfcoT— SIC 1/W h*f-D#ffl 
$TF I FOfct*a*n5f-^t/Hh7< HA* 
F I FO«t§, 

[0 7 4 1] t-?^79 9 6I:H1^(!)*!)>^ 
t^ttT&O, HiAf-^U9 9 3 tAAf-?/U 
9 9 9rt0F I FO©Stttt*«*-*T*fc»KflIV> 

[0 7 4 2] H±o^t*5, 37cHZ3/tf*fl!^££ 

yiffStlTfr £ J PEG3-^24 1£IHV>T 

3^n-feyU-2 4 lttZl*J PEG3-^2 4 l£fflV> 
TJ PEGI>3-ff H-f^-5?«*nW*. ffl*x 
XK*6*tia±© J PEGt3-75/ K*f 

tf£o *Hc, 3^Dty»2 2 4*fflV>5£, n7n* 
irv>^#gl©X tf- F7y ^OHT^*7-f ? 

£tt5©T&£, 

[0 7 4 3] ±Gff9ft»Mf 1 O^Uffi^ 

[0 7 4 4] #«A 

3 ^D-fe ^ iJ- v-f ^ n ^ n y 5 5 >^ 
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1. %fr=^ : f)V 

2. iSMHry h^n-x^ >^ 

3. i^yx^ty h 

4. ftgtSJgjg 10 

A. 1 

Al. 1 r^n-try**©^ HTyyicH-rs-tt* 
3 > h P-;i/^<h n-^JUDM A**£WV©?"^T© 

yU-rtO-tCOffiO^^a-^tt^WlC, BISttTSft 20 
Al. 2 ^^-^©n^^ i^Wya >IS* 

<03>7>f yi/— >3>«B*tt*fcPO f DCC, OO 

B, OOC, MDP, JC, RO, P I C©JQ(*T?<&<5o 
Al. 3 J t<Dl&<D\sVZ>j?0K& 

1. «B6Sft«'C*l/^X^*»o^^a-JH^ 

2. wfcan*^*i/^x^*»oti?3.-->nc» b*- 

\zh\«*JXi?ty h#»*i*tt*©W>X*ttK)fc© 
k^X^cDS^^toofc^lc, *-©*^:x-^©_c 

Al. 4 S£te©fctMft<£;t^7>F©n~-' tV >£f 

i. *^5>Hort«7*-vy han^©^^ 



2. *^5>K©rtBB7*-v-/ h*« r^-©ffi©jgit:j 

^5>F©7*~Vy b£fe&T£o *^5>HBt* 

^y^3>hD-7ffil^tlfc7t-7^ h©:*^5> 
Al. 5 J»6MH*©Xft 

• k^X?€UHe 1 ve t i c a Bo 1 d#T?jto&> 
•k?X^7>f-;i/Hfiregister. field 

•A, BR«C»4afta^ft3ftT^-5^5>l«7- 
KA, **^7>F7~FB, *^7>H7-FCS* 

To 

• A d eskr i p t or, B d e s k r i p t o 

r &&ZfC__d e s k r i p t o r »4Sft«fHfc;£tlT 
V^^©x-^7 - F©xX#'J y^^r^To 

• RI4aft«^{batlTV^**©lB«7-F*3St". 

• " X : Y" »4XtY0*»ft*T. 

@X" »43^n-fey*©l/5?X^##X**T, 
Cbus (X) " t4C/1A*^l/-^a>XO*ff 

'"'Cbus (X) " t4C;«*^W—>3>XfcJ: 

• " * x M ttiHBwtyw&xfts-r. 

• s e t" ttf-^=KaI/-3/a>l/?X^O« 

A. 2 
ft: 

1. ^Ri-'O-FHOxCtOxD 

2. HWcS»4ft±ft7KI/^©/^h ("Tatoft, fi± 

3. 7**.i*\'-*b*toft**9>F&7l'#X'Stl 

5. *^ftHtAAe^-fe^©*KiOjtiiatiTVi*, 
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Pixel organiser fl^-l*?) : 

if i.r=o then 

po_lan <- OzOOOO:X. length 
endif 

if A_descriptor*s=0 then 

po_dmr <- sat. 
endif 

po_aaid <- R 
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po_cfg.mode <- sequential Set going 

po_cfg.dst <- mdp 

-- DCC in n normal32" mode (default) 

--Operand Organiser 8 (^T^y> #±-f1f B) : 

if B_de scrip tor. If 1= other then --not blend 

oot>_len <- po_len 

if B_de script or. S - 0 then 
oob_dmr<- set 

endif 

oob said <- A --special for compositing 
i f B_descriptor.what= tile 

oob_tile < - B 
endif 

oob_cfg. operate <- operate 
endif 

Operand Organiser C > F:*-— if^if C) : 

— There is no implied other data type for this 
--bo used to specify any bit offset in a bit map_attenuation 
if C_descriptor.if !•= other then 
ooc_len po_len 

— set up the ooo_dmr 

if C_descrltor.S « 0 then 

ooo_dmr <- aet including bo 

endif 

ooc_said <- C 

ooc_cfg . operate <- operate 

endif 

— Main Data Path: 

if B_descrlptor.if = other then — blend 
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mdp_len <- pollen 
mdp_bm <- B 
mdp_bi < - A 
endif 

mdp_cfg. instruction <- I. opcode 

mdp_cf g.blendgen <- (B_deBcr±ptor. if - other) 

m <iP— of g. decode <- 1 

mdp_cfg. operate <- 1 

Result Organiser (jg^^t— fJl^f+f) ; 
if R_descritpor.S = 0 then 

ro_dmr <- set 
endif 
ro_sa <- R 

ro_cfg.mode <- sequential 



-- Pixel Organiser: 

— Operand A only makes sense as source pixels, anything else 

— is probably wrong 
if i.R « o then 

po_len <- OxOOOO 1 1, len 
endif 

If A^jOeacxiptor . S « 0 then 

po_<tar <- set 
endif 



a. 3 earn** 

ft: 
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po_sald < - A 

po_muv <-c I £ F tables 

po_cfg-mode <- CSC 
po_cfg.dst <- mdp 
-- DCC setup: 
-- should be other 
if B_.descriptor.if = other then 
dcc_ addr <- B 

dcc_cfg2.cache_mi83__inst <- B_ descriptor .C 
if I.M= 0 then --single output channel 

dcc_cf g2.tnode <- single channel CSC 
else 

dcc__cf gZ*mode <- multi channel CSC 
endif 
endif 

— Operand Organiser B: 

Operand B should he other, anything else is probably 
-- wrong but do it anyway 
if B_descrlptor . if 1- other then 
oob_J.en <- po_len 
if B_descriptor.S= 0 then 

oob_jdmr <- sot 
endif 

oob_said <- B 
oob_cfg. operate <- operate 
endif 

— Operand Organiser C: 

« Operand C shonld be other, anything else Lb probably 



217 

wrong but do it anyway 
if C_descriptor_if 1= other then 
ooc_len <- po_len 
if C_descriptor. S= 0 then 

ooc_dmr <- set 
endif 

ooc_said <- C 

ooc_cfg. operate <- operate 
end if 

Main Data Path: 
mdp_cfg. instruction <- I. opcode 
m<lp_cfg. decode <- 1 
mdp_cfg. operate <- 1 
— Result Organiser: 
if R_ descriptor . S = 0 then 

ro_dmr <- set 
endif 

ro_sa <- R 

ro_cfg.mode <- sequential 
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A. 4 J PEG^ 
ft: 

1. rt^n-Htao x2T*So 
A. 4. 1 #fi 

ft: 1. ttT©l^^^H#**fflWttt«anTV^ 

•ro_idr: WAlii«^7C*^^X^ 

• r o__c u t : mtiti y V V¥7s9 

• ro_lmt : ttitlMmU>?Z>i? 
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-- if there is a register set targeted at anything other than 
-- the JC. PO. SO or DCC 

if (D.R= 1) and (C_d©ccriptor . topnibble 1=0x5 , Gx6.0b9 or OxS) then 

ec_descrlptor <- C 
end if 

-- Pixel Organiser: 

Operand A in the eon rot* aonpreesed data, and ahould be a 

— byte St re an 
If I.R = 0 then 

pollen. <-OxOOOO:t.len 
endiC 

if ^_de ocript or . S - 0 then 

po_dmr <- set 
endlf 

PQ_said <- A 

-- if there is a regie ter set targeted at PO 
if [D,R - 1) and (C_desoriptor.topnibble ° 0x6) then 
$C_desoriptor <- C 
endif 

po_cf g.mode <- sequential 
po_cfg.dat <- jc 

— DCC satupi 

i-£ B_d**cr iptor . if - other then 
dcc_addr <- B 
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if there is a register set targeted at DCC 
if (D.R « X) and ( C_descriptor . topnibble = 0x5) then 

@C_descriptor <- C 
endif 

dcc_cfg2,C < -B_de script or. C 
dcc_cf g2 . mode <JPEG decode 
end if 

operand organiser B: 

Operand B should be other , anything else is probably 
-- wrong, but do it anyway 
if B__descriptor*if != other then 
oob_len <- po_len 
if B„de scrip tor. S=* 0 then 

oob_dmr <- set 
endif 

oob_said <- B 
oob_cfg. operate <- operate 
endif 

--Operand Organiser C: 

-- Operand c may be a register to. set. If it isn't it should 
--be other, anything else is probably wrong, but do it anyway 
if (R.D= 0) and (C_.descriptor.li: 1= other) then 

oocJLen <- po__len 

if C__de8Cxlptor.S"s 0 then 
ooc_dmr <- set 

endif 

ooc_said <- C 
ooc_cfg. operate <- operate 
endif 
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JC setup: 

-- if there is a register set targeted at JC 

if (D.R - 1) and (C_de scrip tor. topnibble-0x9) then 

@C_aescriptor <- C 
endif 

jc_cf g . instruction < - I . opcode 

jc_cfg. decode <- 1 

jc_cfg. operate <- 1 

-- Result Organiser: 

if ^descriptor. S = 0 then 

ro_dmr <- set 
endif 

if there is a register write targeted at RO 
ro_sa < - R 

if (D.R = 1) and (C_descriptor. topnibble = OxD) then 

@c_descriptor <- c 
endif 

ro_cfg.mode <- J peg 
ro_ofg.chan <- I. Ms 1 . 4 

ro_cfg.upsainple <- I.S 
ro_ofg.cut <- I.C 
ro_cfg. limit <- I.T 



1. lXT0Uy7>?\*ft$tmftmzmfe2tlT^Z>&m • r o_c u t : tB**^hl/^ 



A. 4. 2 JEffi 
ft: 



• j c_rml iHX^-hV-^tfH >?-/W 



• r o_lmt : tti*MRI/5?X* 
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--if there is a register set targeted at anything other than 

the JC, PO, ,RO or DCC 
if (D.R=l)and {C_deecriptor. topnibble != 0x5,0x6,0x9 or OxB) then 

<?C_descriptor <- C 
endif 

-- Pixel Organiser: 

— Operand A is the source compressed data, and should be a 
-- byte stream 

if I.R - 0 then 

po__len <- 0x0000 I. len 
endif 

if A_descriptor . S = 0 then 

po_dmr <- set 
endif 

po__said <- A 

if there is a register set targeted at PO 
if (D.R = 1) and ( C_descr iptor . topnibble - 0x6) then 

@C_descriptor <- C 
endif 

po_cfg.mode <- jpeg 
po„cfg.dst <- jc 

po_cfg.SS <- I.S — subsampllng 

po_cf g . chan <- I.M:I4 
po_cfg.F <- I,F 

— DCC setup: 

if B_descriptor,if = Other then 

dcc_addr <- B 
--if there is a register set targeted at DCC 

if (D.R =1) and (C_descriptor . topnibble = 0x5) then 
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@C_de scrip tor <- C 
end if 

dcc_cf g2 . cache_miss_inet<- B_de script or. C 
dcc_cf g2,mode <- JPEG encode 
end if 

-- Operand Organiser B: 

-- Operand B should be other .anything else is probably 

-- wrong, but do it anyway 

if B„de scrip tor. if I- other then 

oob__len <- po_len 

if B_descriptor,S = 0 then 
oob_dnir <- set 

endif 

oob_saia <- B 
oob_cfg, operate <- operate 
endif 

-- Operand Organiser C; 

Operand C may be a register to set. If it isn't it should 
--be other, anything else is probably Wrong, but do it 

— anyway 

if (R.D = 0) and (C_descriptor.i£ I- other) then 
ooc_JLen <- po_len 
if CLdescriptor,S= 0 then 

ooc_dmr <- set 
endif 

ooc_said <- C 
ooc_cfg. operate <- operate 
endif 

— JPEG Coder: 
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--if there is a register set targeted at JC 

if (D.R*l) and ( C_de scrip tor > topnibble = 0x9) then 

@C_descriptor <- C 
endif 

jc__cfg. instruction <- I. opcode 
jc_cfg. decode <- 1 
jc_cfg. operate <- 1 
Result organiser: 
if R_deecriptor.S = 0 then 

ro_dmr <- set 
endif 

ro_sa < - R 

— if there is a register write targeted at RO 
if (D.R=1) and (C_descriptor . topnibble = OxB) then 
§C_descriptor <- C 

endif 

ro_cfg.mode <- sequential 
ro_cfg.cut <- I.C 
ro__cfg. limit <- I.T 



1. t^TOf-^a-f^ >{f#ftttBMi* m&mm -packbi t s«*t{b (A*->?3 >B) 



A. 5 >if 
ft: 



30 



•packbi t s ft*f<fc (/t-5?a >A) 



3. »imuT^-T4>>fmft 



4. *^7>\ i C\mfe-?Z>tzlt><Dlsi?A?'V i b&\tK 
6. &T<0l^**S#^l^ 

• r o_c u t : y V VV7,$ 
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j_f(D.R - 1) and ( C_de script ox . topnibblo J - 0x5,0x6 f 0x9or0xB)thtsu 
@C_descriptox <- C 

Midi* 

Pixel Organiser: 

— Operand A is tho source data 
if I.R - 0 than 

po_len <- 0x0000: I. Ion 
endlx 

±£ /v_de script or .S = o then 

po_dmr <- sot 
end If 

po_said <- A 

— if there is a register set targeted at PO 

i£ (I>,R - 1) and < C_d6ecriptor . topnibble - 0x6} then 

6C_dficcxiptor <- C 
endif 

po_cig.i»ods <- sequential 
po_cfg.dat <- jc 

DCC satupt 
1£ BUdeacrXptor.it - other then 
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dcc_addr < - B 
if there is a register set targeted at DCC 

if (D.R = 1) and (C_descriptor . topnibbte = 0x5} then 

@C_de scrip tor < - C 
endif 

dcc_cfg2.cache_jniss_inst <- B_de script or. C 
dcc_cf g2 .mode <- en/de coding 
else 

-- if there is a register set targeted at DCC 
if (D.R = 1) and ( ^descriptor, topnibble = 0x5) then 

@C_aescriptor <- C 
endif 

dcc_cfg2.caahe_miss_inst < - B_descriptor.C 
endif 

Operand Organiser B: 

Operand B Should be other , anything else is probably 
-- wrong, but do it anyway 
if B_descriptor.if != other then 
oob_len <- po_len 
if B_.descxlptor.S- 0 then 

oob_drar <- set 
endif 

oob_said <- B 
oob_cfg. operate <- operate 
endif 

— Operand Organiser C: 

— Operand C may be a register to set. If It Isn't it should 

— be other, anything else is probably wrong, but do it 

— anyway 
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if (R.D = 0) and (C_descriptor. if i= other) then 

ooc_len <- po_len 

if C_descriptor_S= 0 then 

ooc_dmr <- set 

endif 

ooc_said < - C 
ooc_cfg. operate <- operate 
endif 

— JPEG Coder: 

--if there is a register set targeted at JC 

if (D.R = 1) and (C_descriptor. topnibble - 0x9) then 

@C_descriptor <- C 
endif 

jc_cfg. instruction < - I. opcode 

jc_cfg. decode <- 1 

jc_cfg . operate <- 1 

-- Result Organiser: 

if R_descriptor .S = 0 then 

ro_dmr <- set 
endif 

ro_ea <- R 

— if there is a register write targeted at RO 

if (D.R = 1) and ( C_descriptor . topnibble « OxB) then 

ec_descriptor <- C 
endif 

ro_cfg.mode <- sequential 
ro„cf g . cut < - I . C 
ro_cfg. limit <- l.T 

a. 6 &&ta&&& -*;uo*€rs mm fc©K*fbT, 

1. #<3-Kte0x4 £0x5 \tXry7V4XifilV-Z\Z&±)Vt1ir>T\f*Z>Z.t 



( 120 ) «fB8¥l 1 - 8 5 9 6 9 

237 238 

-- Pixel Organiser: 

Opov&nd. & is the descriptor and the PO da livers 

-- Kernel co-eff iciants to the HDP. This is ceded as 
-- -Other" 

L»l long format 
0 short format 
if I.R =» 0 then 



endif 

if A_descriptor.S = 0 than 

po_dmr <- set 
endif 

po_said <- A 

po_cf g.mode <- convolution/ transformation 
po_cfg.dst <- map 
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DCC setup: 

the Implied data type here is the source image 
if B_descriptor.if = other then 
dcc_cf g2.cache_miss_inst <- B_de script or. C 
if I.S = 0 then 

dcc_cfg2.rnode<- 64 bit mode 
else 

dcc-cfg2.mode <- random ,mode 
endif 
else 

dcc_cf g2.cache_mise_inst <- B_de script or. C 
endif 

— Operand Organiser B: 

-- Operand B is a pointer to source image (other), anything 
else is probably wrong, but do it anyway 
if B_descriptor.lf != other then 

obb_len <- po_len 

if B_descriptor.S = 0 then 
oob_dmr <- set 

endif 

oob said <- B 
oob_ctg. operate <- operate 
endif 

operand organiser d 

— Operand C descriptor is borrowed by the r. c field 

— bo 00C is not set up 
-- Main Data Path: 

Kernel descriptor is delivered to MOP via PO 
mdp_JLen <- po_len 
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mdp_bm . rows < 



C_de s cr ipt or . r 



mdp_bm . cols < 



C_de s c rapt or . c 



mdp_cf g . instruction < - I . opcode 
mdp_cfg.long_krnl <- A_descriptor.L 
mdp_cfg.de code <- 1 
mdp_cfg. operate <- 1 
-- Result Organiser: 

-- result will be pixels or part thereof 
if R_de scrip tor . S= 0 then 

ro_dmr <- set 
endif 

ro_sa <- R 

rc_,cf g . mode <- sequential 



--if there la a register set targeted at anything other than 

the HDP, PO, RO or DOC 
if(D.R-l) and(C_dB8criptor,topnibble I- 0x5,Ox6,0xA or OxB) then 

QC_G&$ cxlpt or <- C 
endif 

— Pixel Organiser i 

Operand A is the source pixels, only makes sense to have 

— whole pixels, anything else is probably wrong 
if I.R « 0 then 



a. 7 nmrnn 
a: 

1. t^-HflOx 3 

2. :t7^>3> : 
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po_len <- 0x0000: I. len 
endif 

if A-descriptor.S = 0 then 

po_dmr <- set 
end if 

po_said <- A 

-- if there is a register Set targeted at PO 

if (D.R=l)and ( C_de script or. topnibble ~ 0x6) then 

@C_descriptor <- C 
endif 

po_cfg.mode <- sequential 
po_cfg.dst <- radp 
DCC setup: 

— the implied other data type is a matrix of coefficients 
if B_descriptor_i£ - other then 

dcc_addr <- B 
endif 

— if there la a register write targeted at DCC 

if (D.R - 1) and (C_desoriptor. topnibble - 0x5) then 

GC_descriptor <- C 
endif 

if B_descriptor.if = other 

aoc_cfg2.caohe_piiss_inst <- B_de script or, C 

dcc_cf g2 *mode <- matrix multiply 
else 

dcc_cfg2.cache_jniss_inst <- B_descr±ptor.C 
endif 

— Operand Organiser B: 

— Operand B Is the matrix co-efficients (other). 
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anything else is probably wrong, but do it anyway 
if B_descriptor * if ! = other then 
oob_len <- po_len 
if B_descriptox . S = 0 then 

oob„dmr <- set 
endif 

oob_said<- B 

oob_cfg. operate <- operate 
endif 

— Operand Organiser C: 

Operand C may e a register to Set. If it isn't it should 
--be other, anything else is probably wrong but do it anyway 
if (R.D = 0) and (C_descriptor.if !- other} then 
ooc_len <- po_len 
if C_descriptor- S= 0 then 

ooc_dmr <- set 
endif 

ooc_said <- C 
ooc_ofg. operate <- operate 
endif 

— MDP setup: 

— if there is a register Set targeted at MDP 

if (D.R » 1) and {C_ desoriptor.ponibble = OxA) then 

ec„descriptor <- C 
endif 

cndp_cfg. instruction < - I, opcode 
mdp_cfg. decode <- 1 
mdp_cfg. operate <- 1 
Result Organiser: 
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if R_descriptor.S = 0 then 

ro_dmr <- Set 

endif 

ro_sa <- R 

if there is a register write targeted at RO 
if (D.R = 1) and (C_descriptor.topnibble = OxB) then 

@C_descriptor <- C 
endif 

ro_cfg.mode <- sequential 

A. 8 1 — >W}>W 

a: 

1. rfO-KteO x 7 

2. ^-y^3>W\-y) — XDWVUOfr 

3. A-7 1 — >Xt7V->&m®\Z*yi/^$>Z>W$ 
T>*y v^£tlTV>£^9> bf^*fe;^^VM^A-r 20 

— Pixel Organiser : 

-- A operand is the source pixels or bytes 
--PO operates In conventional sequential mode 
if I.R - 0 then 

pollen <- OxO0O0:I.len 
endif 

if ^descriptor. 3 = 0 then 

po_dmr <- set 
endif 

po_sald <- A 

po_cfg.i*ode <- sequential 
po__cf 3. dst <- ntdp 

— DCC setup? 

— no implied other data type 
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if there is a register write targeted at DCC 
if (D-R = 1) and (C_dcscriptor. top nibble - 0x5) then 

@C_j5sscriptor <- C 
endif 

if B_deecriptor .C - 0 then 

dcc_cf g2 . cache_miss — inst < - B_descriptor . C 
endif 

-- Operanc organiser B : 

-- Operand B is the halftone screen, and should be a tile 
« anything else is probably wrong, There is no sensible 

— implied •other" data type for half toning* 
if B_de scrip tor. if 1= other then 

obb_len <- po_len 

if B_de scrip tor . S= 0 then 

oob_dmr <- set 
endif 

oob__said <- B 

if B_descrlptor ,vhat= tile 

oob_tile <- C 
endif 

oob_cfg. operate <- operate 
endif 

— Operand Organise C; 

-- Operand C word Is borrowed by the tiled Operand B. 

— The C_deecxiptor is completely ignored 

OOC is In random mode (default) and wont be touched 

Main Data Path: 
mdp_bm. level <- I. levels 
mdp_cfg. instruction <- 1. opcode 
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mdp_cfg.aecoae <- l 

mdp_cfg. operate <- 1 

~ Result Organiser: 

if R_descriptor«S = 0 then 

ro_dinr <- set 
endif 

ro_sa <- R 

ro„cfg.mode <- sequential 

A. 9 ^u-ntr- 

1. *^-Htt0x92. ^©A^tt^tU-nfc!'- 
po._cfg.dst <- Jc 
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• ^U7x7il/DMAf^P I C 4: LM Claret 

A. 9. 1 
-- Pixel Organiser*. 

A Operand is the so urea data 

-- PO operat«s in conventional sequential mode 

If I.R - 0 then 

po_len <- 0x0000*1 . len 

eadlf 

If A_deBcrlptor. 3 - 0 than 

po_dnrr <- B 
end if 

po_oaid <- A 

po_c£g.mode <- sequential 



— Operand Organiser B: 

— Operand B word represents a value to put in po_dmr 

— Operand Organiser C: 

— Operand C word represents the value to put in the ro_dmr 
JPEG Coder: 

jc_cfg. instruction <- I . opcode 

3c_cfg.lbo <- D.bo 

jc__cfg.obo <- I.obo 

jc_cfg, decode <- l 

jc_cfg. operate <- 1 

-* Result Organiser: 

if R_de scrip tor. S = 0 then 

ro_dmr <- C 
end 

ro_sa <- R 

ro_cfg.mode <- sequential 



a. 9. 2 ^U7x5)^bkiAtea 
i t^l 



3. P I Clif-i'^^t^a-MOT, ZL0fir$ 

tis*o" mm" tSM^M&s. 
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--only setup required is for the PIC: 
if I.S = 1 then —data in from PIC 

pic_abo£_addr <- R 

els* 

pic_abus_addr <- A 

endif 

— if there is a register write targeted anywhere 
if D.R = 1 then 

i2c_deiicr-L.ptor <-■ C 
endif 

if LR = 0 then 

pic_abue_cf g.ab_count <- 0x00: 1. length 
endlf 

pic_abus_cfg.ab_byte_en <- I « byte 
pic_abus_cf g*ab_type <- I.R 
pic_abus„c£g. start <- 1 

Pixel Organiser: 

Operand A is the source address given to the PIC 
-- The PO is not setup 

— Operand Organiser B: 

— OOB is not setup 
--Operand Organiser C 

— OOC is not setup 

— Result Organisers 

— The Result wort represents the target address 

— RO is not set up 

A. 10 7*hCD#fi A. 10. 1 *¥*ifH 
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— if there is a register set targeted at anything other than 
-- the MDP,PO or RO 

if (D.R = 1) and ( C_de script or .topnibble != 0x6, OxA or OxB) then 

(ac_descriptor <- C 
endif 

-- Pixel Organiser: 
if I.R = 0 then 

po_len < - OxOOOO : I . lea 
endif 

if ^descriptor. S ~ 0 then 

po_drnr <- eat 
endif 

po_said <- A 

— if there is a register set targeted at PO 

if (D.R = 1) and (C_de scrip tor .topnibble - 0x6) then 

§C_ descriptor <- C 
endif 

po_cf g.moda <- sequential 
po_cfg.dst <- mdp 

Operand Organiser B: 

— Operand a word is completely ignored 

— The B_de scrip tor is completely ignored 
-- Operand Organiser Cs 

— Operand C may be a register to Set up 

— Main Data Path: 

— if there is a register set targeted at MDP 

if (D.R - 1) and (C_descriptor. topnibble = OxA) then 
ec_desoxiptor <- C 
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endif 

mdp_cfg. instruction <- I -opcode 

if A_descriptor. if /--- packed by - :. th^n-- tell map about, siz: 

radp_cf g. word_input <- 1 
else 

mdp_cf g. word_JLnput <- 0 
endif 

mdp_cfg. decode <- 1 
mdp_cfg. operate <- 1 
-- Result Organiser: 
if Rde scrip tor . S = 0 then 

ro_dmr <- set 
endif 

ro_ sa <- R 

ro_cfg.mode <- sequential 



--If there Is a register set targeted at anything other than 
-- the HDP, PO, OOB ox RO 

if (D.K=l)and(C-deacriptor.topnibble I- 0x6,0x7 ,0xA or OxB) then 

fcC_dea crip tor <- C 
endif 

-- Pixel Organiser * 
if I.R « 0 then 
po_lan <- 0x0000 s I. len 



10.2 mjtMmtmme 



2. vpwtn-i hommzttLrmfifim. 
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endif 

if ^descriptor ,S = 0 then 

po_dmr <- set 
endif 

po__said <- A 

if there is a register set targeted at PO 
if (D.R - 1) and (C_descriptor. topnibble - 0x6) then 

@C_descriptor <- C 
endif 

po_cfg.mode <- sequential 
po_cfg.dst <- radp 

— DCC setups 

— no implied other data type 

--if there is a register write targeted at DCC 

if (D.R = 1) and (C_descriptor. topnibble = 0x5) then 

@C_des crip tor < . - C 
endif 

if B_de scrip tor. c = 0 then 

dcc_cf g2.oaohe _jniss_inst <- B_descriptor.C 
endif 

Operand Organiser B: 

— Operand B is the residual a or one operand for 

— interpolation, anything else Is probably wrong, but 

— do it anyway 

-- There is no sensible implied "other" data type for 
-- this. 

if B_de scrip tor. if t - other then 
oobJ.en <- po_len 
if B_descriptor . S = 0 then 
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oob_dmr <- set 
endif 

oob_said <- B 
endtf 

-- if there is a register set targeted at OOB 
if (D.R = 1) and (C_descriptor.topnibble = 0x6) then 
@C_descriptor <- C 



oob_cfg. operate <- operate 
endif 

Operand Organiser C: 

— may be a register to set 

— Hain Data Path: 

— if there is a register set targeted at MDP 

if (D.R = i) and (C_descriptor. topnibble « OxA) then 

@C_descriptor <- C 
endif 

mdp_cfg„ instruction <- I. opcode 

if A_descriptor.if /= packed_bytes then — tell mdp about size 
mdp_jcf g.word^input <- 1 

else 

mdp_cf g . word_input <- 0 
endif 

mdp_efg. decode <- 1 
mdp_cfg„ operate <- 1 

— Result Organiser x 

if R_descriptor.S - 0 then 

ro_dmr <- set 
endif 



endif 



ro_sa <- R 



ro_cf g . mode <- 0 



— sequential 
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1. *^3-HH0 xB 

3. n7 o n-tr^ij-^T^0^^^|^g^Tt)tiT, Sfcd 



only thing to do 1st 
if D.R = 1 then 

PC_<le&c r.iptor <- C 
endif 

-- Pixel Organiser: 

no setup associated with operand A 

— Operand Organiser B: 

— no Betup associated with Operand B 
Operand Organiser C: 

--no Betup associated with Operand C 

Result Organ! eer; 
-- no setup associated with the Result 

A. 1 1. 2 jH9iT9±7, (U-H) 

a: 

1. 

3. RO&mmz3?U*yVftT*^X&fi5m—<D 

Pixel Organiser! 

— no setup associated with Operand A 
-- Operand Organiser Bi 
--no setup associated with Operand B 

— Operand Organiser Cs 
--no setup associated with Operand C 
-- Result Orgaalser 
ro_sabst <- Cbas(A) 
ro_s« <- B 

ro_cf g.nodfi <- Cbufl.dnts 

A. 1 1. 3 rt«7£-fe* i3< Y) 
ft: 
1. 

2. ^h^lix-^*3yn*y*rtfcliai-«. 

3. £0<MM4" m®" ft^TflifeV^OT, I C w 



- 3 111 J ' V : *• >,Vfi»-fi^ 
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Cbus(A) <- B— Pixel Organiser: 

— no setup associated with Operand A 

— Operand Organiser B: 
no setup associated with Operand B 
Operand Organiser Ci 

— no setup associated with operand C 
Result Organiser: 

-- no setup associated with the Result 

a. 1 2 ^mznrcfrf? 
a : 

1. OxFttfftSftTV^. 

2. Tiesnfc^ttvx^^iiB&x^-stbr. 

3. cft6©^»anfc»*tt3^Dty^^a*BT 

1. i upxp 7)1? 

SWiWJIH >^-7x-X©b^>1f^>3 >£J8V> 

2. *<©l/5?X^tttl^lfr©WfP«KJ:Drt«3We 
30 fritV^aaSftWPItt, «*&W?**fc8ft©tt« 

SfU RJtSn*. V><Oj&HDl^X*Ttt*M**fr© 

*6»«i/^x^*tt*©«rta»© r^*tjj ©£#te 
40 «*ai$n-&o 

1. 1. lU&X3r<D#m 
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/Wiry ht-K^LTPC I /IX A* S 
-try htS&Vi, 
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<£&tt*nJ;D;*:€r&) ^>7^al/- 

a>7jy2i/i?Zsi?hftm^m&m^m^, was 
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7 
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prectiargs_ 
Daniel 


yU5V-^<n>#7^Xt^h <n+5 

) 
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[16] 
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ras_bi 


RASCOS^hift^ny^-^-r^ (ras© 
^U-^-^) (IJbm 4, 0=1) 


[10 : 9] 




£ D ZMZim* HAS CAS 'S 

oMa (2^6, 0=2) 


[8:7] 




£ n £ sdram 

CAS latency (JglE) t 

2= 2 £ P y 2? 
3—3 ^Oy^ 


[6:0] 




reserved 



HIO x 0 0 0 7 f f 8 0 &##-T£. 
b f . lmi jnode 





m 




[31 :i4] 




Reserved 


[13] 


IxHtiliie i 




[12:0] 







sai= It s b & am*- t* fcst&*ftste« 



fe^*UliM;H©U-l:*>t«feitttes*sh*. c 




( 179 ) 1-8 
355 356 

7A-r^fee6ic, aiBfcey Mi-fey hana. ash >?-7x-xi/^^ 

l/^X^©«*a*Hffft» £©fc:y Maa»»»;H?n bg. pic„cfg 



^1.60 plc_ofg l^X^7^— 









0 
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reearved 
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21 


revere O. 
avert data 




22 


fcn*fart|_ l 

OVBTt^jCDnt 
EOl 




23 


reveree_JL 

OVGH't^ OO 

Qtrol 




30:24 




*Jtfmtt&-** h 'J-AcBus IM"? 




31 


cbas_fci«ar 
^Usable 





bh. pi c__s tat 



( 181 ) »BB¥l 1 -8 5 9 6 9 

359 360 
pic,stat ;PK 



(£>;/ w 






o 


nfaiifz art 


ABua (Dh^ ISrffy 1/ 3 >tV >y 


1 




iujuo i ^ ,j s ^ * ^ s " ^ y 


*> 


VQTS Q_3jCt 


p 


3 
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IF (AREQ) THEN 

IF (BREQ & (A>B) * CREQ & (A>C) + OREO & (A>01) THEN 
GRANT A 
IF (BREQ) THEN 

(F (AREQ & l<A>B) + CREQ & (B>C) + OREQ & (B*Ol) THEN 
GRANT B 
IF (CREQ) THEN 

tf (AREQ & l(A>C) + BREQ & !(B»C) + OREQ A (C>0)l THEN 
GRANT C 
IF {OREQ) THEN 

IF {AREQ & !(A>D) «• BREQ & !{B>0) + CREQ & l{C>D» THEN 
GRANT D 

IF (GRANT A) THEN { 

IF ((AU=BU) & (AL>BU> THEN 
BL++ 

IF ((AU=CU) & (AL>C1_)) THEN 

IF <(AU=OU) A (AL>OL» THEN 

OL++ 
AL = 0 

> 

ELSEIF (GRANT B) THEN { 

!F ((AU==BU) & i(AL>BL)) THEN 
AL++ 

IF «BU==CU) & (BL>CL)) THEN 
CL++ 

IF ((BU=DU) & <BL>OL)) THEN 

DL++ 
BL = 0 

} 

ELSEIF (GRANT C> THEN ( 

IF <(AU==CU) ft !(AL>CL)) THEN 
AL++ 

IF ((BU=CU) ft !(BL>CU) THEN 
BL++ 

IF ((CU==OU) & (CL>DL)) THEN 

DL-w- 
O_ = 0 

} 

ELSEIF (GRANT O) THEN { 

IF ((AU=*=DU) & KAL>DL)) THEN 
AL*+ 

IF «BU*=OU» & !<BL>DL)) THEN 

IF «CU==DU) & !(CL»OL)J THEN 

CL++ 
DL =0 

I 
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1. TITLE OF THE INVENTION 

5 IMAGE PROCESSING APPARATUS AND METHOD THEREFOR 

2. CLAIMS 

1. A method of rendering image data prior to output of the resulting 
image, said method comprising the steps of: 

10 (1) utilising a graphics co-processor in conjunction with a host 

processor, 

(2) within the graphics co-processor providing a plurality of data 
calculation streams arranged in parallel fashion and 

(3) operating only one of said data calculation streams at any one time 
15 to perform like image calculations on a stream of like image data. 

2. A method as claimed in claim 1 including the step of: 

(4) providing at least one of said data calculation streams with a 
reconf igurable calculation unit, and 

20 (5) at the conclusion of step (3) reconf iguring said calculation uni t. 

3. A method as claimed in claim 2 wherein step (3) is repeated after 
step (5). 

25 4. A method as claimed in claim L including the step of normalising 

said image data prior to step (3). 

5. A method as claimed in claim I including the step of providing two 
data calculation streams. 

30 

6. A method as claimed in claim 1 and Including the step of providing 
three data calculation streams. 

7. A method as el aimed in claim 1 and including the step of output ting 
35 said resultant image by printing the image on a printer. 

8. Apparatus for rendering image data prior to output ting the 
resulting image, said apparatus comprising a graphics co-processor adapted to 
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operation in conjunct ion with a host processor, said graphics co-processor having 
a plurality of data calculation streams arranged in parallel fashion but only one 
of which is operated at any one time to perform like image-calculations on a stream 
of 1 ike image data. 

5 

9. Apparatus as claiaed in claim 8 wherein at least one of said data 
calculation streams has a reconf igurable calculation unit. 

10. Apparatus as claimed in claim 9 wherein said calculation unit is 
io ^configurable during operation of another of said data calculation streams. 

11. Apparatus as claimed in claim 8 wherein Ihe number of said data 
streams is two. 

15 12. Apparatus as claiaed in claim 8 wherein Ihe number of said data 

streams is three. 

13. Apparatus as claimed in claim 8 and including data normalisation 
means upstream of said parallel data calculation streams. 

20 

14. Apparatus as claimed in claim 8 including a printer onto which said 
resulting image is output. 

15. A method of controlling the interaction between a host CPU and at 
25 leasl one co-processor in a computer sys tea to permit substantially simultaneous 

decoupled execution of CPU instructions and co-processor instructions, and dynamic 
allocation of commonly used memory space during the course of the execution of said 
instructions, said method comprising the steps of: 

(a) said host CPU allocating memory resources to be utilized by a set 
30 of instructions to be co-processor executed; 

(b) generating a queue ol pending co-processor instructions to be 
executed and a clean up queue of co-processor instructions for which execution has 
been completed; 

(c) from time to time, under control of said host CPU releasing for 
35 re-al location memory resources previously utilized by the Instructions contained 

in said clean up queue of executed instructions. 
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17. A meihod as claimed in claim 16 wherein said predetermined 
instruction is one vliicti utilim very substantial memory- resources. 

18. A method as claimed in claim 16 wherein said predetermined 
instruction is the last instruction in said pending instruction queue. 

19. A method as claimed in clain 15 vherein step (c) is carried out when 
said host CPU detects that currently free nemory resources are running low or are 
exhausted. 

20. A method as claimed in claim 19 wherein further processing by said 
host CPU is interrupted and step (c) is carried out after a predetermined fraction 
of said queue of pending co-processor instructions have been executed. 

21. A method as claimed in claim 20 wherein said predetermined fraction 
is selected front the group consisting of one third, one half and two thirds. 

22. Amelhod as clained in clain 20 wherein step (c) is carried out after 
a predetermined number of the pending instructions in said queue of co-processor 
instructions have been executed. 

23. A method as c tainted in claim 15 wherein step (a) allocates 
instructions for a single co-processor. 

24 Dynamic memory management Deans in a computer system baring a memory 
of predetermined size, a host CPU and at least one co-processor, said memory 
management means comprising: 

(a) an instruction generator means connected with said host CPU and 
generating a sequence of instructions intended for co-processor execution, 

(b) a memory manager leans connected to said nemory and said instruction 
generator means to dynamically allocate space in said memory for co-processor use 
in executing said sequence of co-processor instructions, 

(c) a queue manager meant connected to said instruction generator means, 
said memory manager neans and said co-processor, said queue manager means being 
arranged to store said sentience, of instructions in a queue of pending instructions 
to be co-processor executed and a clean up queue of instructions which have been 
co-processor executed, 

wherein from time to time said queue manager means removes executed 
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instructions from said clean up queue lo thereby release for reallocation memory 
space previously allocated to said removed executed instructions. 

25. The dynamic memory management means as claimed in claim 24 wherein 
5 if said memory manager means is unable to satisfy a request for memory space, said 

Queue manager means is triggered to remove said executed instructions from said 
clean up queue. 

26. The dynamic memory management means as claimed in claim 25 wherein 
10 if after removal of said executed instructions Iron said clean up queue said memory 

manager means is still unable to satisfy a request for memory space, further 
processing by said CPU is interrupted until a predetermined fraction of saidaueue 
of pending co-processor instructions has been co-processor executed. 

15 27. The dynamic memory management means as claimed in claim 26 wherein 

said pre-determined fraction is selected from the group consisting of one third, 
one half, and two thirds. 

28. The dynamic memory management means as claimed in claim 26 wherein 
20 further processing by said CPU is interrupted oati 1 a predetermined number of said 

instructions in said queue of pending co-processor instructions have been co- 
processor executed. 

29. The dynamic memory management means as claimed in claim 28 wherein 
25 said predetermined number corresponds to the entire length of said queue of pending 

co-processor instructions. 

30. A co-processor adapted for using virtual memory with a host 
processing means, a memory coupled to said host processing means to implement said 

30 virtual memory, said co-processor including: 

vir tua 1-phys icat memory mapping means for interrogating a virtual memory 
tabic and for napping one or more virtual memory addresses requested by said 
co processor into corresponding physical addresses in said memory of said host 
processing means. 

3S 

31 The co-processor according to claim 30, wherein said virlual memory 
tabic is stored in two or more non-con tiguously addressable regions of said memory. 

32. the to-processor according to claim 30, wherein said virtual- 
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physical memory mapping means further includes: 

a nui tiplc-cntry translation lookaside buffer for caching virtual-to- 
physical address mappings. 

33. The co-processor according to claim 32. said virtual-physical 
memory capping thjans further includes: 

neans for comparing, replacing, singly invalidating and multiply 
invalidating one or more entries of said translation lookaside bufier. 

34. The co-processor according to claim 32, wherein said translation 
lookaside buffer is adapted to replace entries therein on a least recently used 
replacement basis. 

35. The co-processor according to claim 32, Therein said virtual- 
physical memory mapping means includes: 

neans for upon an occurrence of amiss in said translation lookaside buffer, 
hashing a virtual memory address using a hash function to produce an index intn 
said virtual memory table of said host processing neans. 

36. The co-processor according to any one of claims 30, 31 and 35, 
wherein said virtual memory table is a page table. 

37. A method uf operating a co-processor to use virtual memory with a 
host processing means, a memory coupled to said host processing means to implement 
said virtual memory, said method including the steps of: 

interrogating a virtual memory table; and 

napping one ih mure virtual memory addresses requested by said co-processor 
Into corresponding physical addresses In said memory of said host processing means. 

. . 38. „ The method according to ; cJaim.3,7 fk therein saiji ▼irtna.I memory table 
is stored in two or more non-contiguous I y addressable i eg ions of said lienor v. 

39. The method according to claim 37, further Including the step of: 
caching virtual-to-phystcal address mappings in a multiple-entry 

translation lookaside buffer. 

40. The method according to claim 39, further including the steps of: 
comparing, replacing, singly invalidating and multiply invalidating one 

or more entries of said translation lookaside buffer. 
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41. The method according to claim 39, wherein entries of said 
translation lookaside buffer are replaced on a least recently used replacement 

basis. 

5 

42. The method according lo claim 39, further comprising the step of: 
upon an occurrence of a miss in said translation lookaside buffer, hashing 

a virtual mcnory address using a hash function to produce an index into said virtual 
memory lablc of said host processing meansi 

10 

43. The co-processor according to any one of claims 37, 38 and 42, 
wherein said virtuil memory table is a page table. 

44. A memory management apparatus for a co-processor coupled to a host 
15 processor will virtual memory, said virtual memory comprising a plurality of 

virtual -memory pages contained in a host memory coupled to said host processor, 
said host memory comprising a plurality of physical pages, said memory management 
apparatus including: 

buffering means for caching a predetermined nunber of virtual -to-physical 
2D memory address mappings, each memory-address mapping including a virtual-memory 
address and a corresponding physical address; 

means for comparing a Yirtual-meiiory address requested by said co-processor 
with said memory-address mappings currently cached by said buffering means; 

means for, if said comparing means determines said virtual memory address 
25 requested by said co-processor matches a virtual-memory address of one of said 
memory-address mappings in said buffering means, providing a physical address to 
said co-processor from said matching memory-address napping; and 

means for, if said comparing means determines said virtual-memory address 
requested by said co-processor does not match the virtual memory address of any 
30 memory-address mapping currently cached in said buffering means, updating said 
buffering means wilh a memory-address mapping retrieved fron a page (able, said 
retrieved memory-address mapping containing a physical address corresponding to 
said virtual memory address requested by said co-processor, said page table stored 
In a predetermined number of said physical pages. 

35 



45. The apparatus according to claim 44, wherein said predetermined 
number of said physical pages include at least two pages stored in two or more 
non-con tigdoiisiy addressable regions of said host toemory. 
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46. The apparatus according to claim 44, wherein: 

said virtual memory address of said memory-address mapping is a virtual 
page number ; 

said physical address of said memory-address mapping is a physical page 
5 number in said host memory: and 

said requested virtual memory .uidres., comprises an offset within a 
respective virtual memory page, said offsel being a predetermined number of lower 
order hi is dependent on the size of said respective virtual memory page and also 
being the offset within a corresponding physical page. 

10 

47. The apparatus according to claim 44, wherein said pages are of a 
predetermined size, and said virtual memory page numbers and said physical memory 
page numbers are dimensioned to consist of a predetermined number of bi ts dependent 
upon said predetermined page size and a total size of said addressable memory. 

15 

48. The apparatus according to claim 47, wherein said virtual and 
physical page numbers include a fixed size portion of address bits dependent upon 
a minimum possible page size, and a variable size portion of the remaining address 
bits corresponding to a currently specified page size. 

20 

49. The apparatus according to claim 44, wherein said page table 
includes a predetermined number of memory-address mappings, each memory-address 
mapping including a virtual-memory page address and a physical page address in said 
host memory, said predetermined number of memory-address mappings corresponding 

25 to the number of said physical pages in said host memory. 

50. The apparatus according to claim 44, wherein said updating means 
fur (her Includes: 

means for hashing a virtual-memory address of a virtual-memory page of a 
30 first number of bits Into a page table index of a second number of bits, said second 
number being less than said first lumber of bits. 

51. The apparatus according to claim 50, wherein said updating means 
further includes: 

35 means fur storing physical addresses of each of said predetermined number 

of physical pages containing said page table. 



52. The apparatus according to claim 51, wherein said page table index 
comprises a first predefined number of bits for selecting one of said physical 
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addresses of said page-table physical pages, and a second predefined number of 
remaining bits for selecting said retrieved memory-address mapping within said 
selected page-table physical page. 

5 53. The apparatus according to claim 52. further including a leas t- 

recenliy-used buffer ior tracking use of said nemnry-address mappings in said 
buffering means, and wherein said updating means further includes: 

moans for comparing the virtual-memory page address of said retrieved 
nemory-address mapping with a virtual-memory page address of said requested 
10 virtual-memory address; 

means far, if said comparing means determines a match, storing said 
retrieved memory-address mapping in said buffering means dependent upon said 
least-recently-used buffer and for updating said least-recently-used buffer; and 
means for. if said comparing neans does not determine a match, incrementally 
15 addressing the next memory-addressing mapping in said page table and for continuing 
operation with said means for comparing the virtual -memory page addresses. 

54. The apparatus according to claim 44, wherein said page table 
contains at least one invalid entry. 

20 

55. A method of managing virtual memory for a co-processor coupled to 
a host processor with said virtual memory, said virtual memory comprising a 
plurality of virtual -memory pages contained in a host memory coupled to said host 
processor, said host memory comprising a plurality of physical pages, said method 

25 including the steps of: 

caching a predetermined number of virtual- to-physical memory address 
nappi n gs i n bu f f e r i ng means, each memory-address oapp ing i nc 1 ud i ng a vi r t ua 1 -memory 
address and a corresponding physical address: 

comparing a virtual-memory address requested by said co-processor with said 

30 memory-address napplngs currently cached by said buffering neans; 

if said comparison determines said virtual memory address requested by said 
co-processor matches a virtual-memory address of one of said memory-address 
mappings in said buffering means, providing a physical address to said co-processor 
from said matching memory-address napping; and 

35 if said comparison determines said vi rtual -memory address requested by said 

co-processor does not match the virtual memory address of any memory-address mapping 
currently cached in said buffering means, updating said buffering means with a 
memory-address mapping retrieved from a page table, said retrieved memory-address 
mapping containing a physical address corresponding to said virtual memory address 
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requested by said co-processor, said page table stored in a predetermined number 

of said physical pages. 

56. The method according to claim 55, wherein said predetermined number 
5 ol said physical pages include at least Ivo pages stored in two or more non- 

contiguously addressable regions cf said host memory. 

57. The method according to claim 55, wherein: 

said virtual menory address of said memory-address mapping is a virtual 
10 page number; 

said physical address of said memory-address mapping is a physical page 
nunber in said host memory; and 

said requested virtual memory address comprises an offset within a 
respective virtual memory page, said offset being a predetermined number of lower 
15 order bits dependent on the size of said respective virtual memory page and also 
being the offset within a corresponding physical page. 

58. The method according to claim 55, wherein said pages are of a 
predetermined size, and said virtual memory page numbers and said physical memory 

20 page numbers are dimensioned to consist of a predetermined number of bits dependent 
upon said predeternined page size and a total size of said addressable memory. 

59. The method according to claim 58, wherein said virtual and physical 
page numbers include a fixed size portion of address bits dependent upon a minimum 

25 possible page size, and a variable size portion of the remaining address bits 
corresponding to a currently specified page size. 

60. The method according to claim 55, wherein said page table includes 
a predetermined number of nemory-address mappings, each memory-address mapping 

30 including a virtual-memory page address and a physical page address in said host 
lenory. said predetermined number of memory-address mappings corresponding to the 
number of said physical pages in said host memory. 

61. The method according to claim 55, wherein said updating step further 
35 includes the step of: 

hashing a virtual -memory address of a virtual-memory page of a first number 
of bits into a page table index of a second number of bits, said second number being 



less than said first number of bits. 
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62. The method according to claim 61, wherein said updat ing s tep further 
includes the step of: 

storing physical addresses of each of said predetermined number of physical 
pages containing said page table. 

63. The me (hod according to claim 52, wherein said page table index 
comprises a first predefined number of bits for selecting one of said physical 
addresses of said page-table physical pages, and a second predefined number of the 
remaining bits for selecting said retrieved memory-address mapping within said 
selected page-table physical page. 

64. The method according lu claim 53, further including the step of 
tracking said menory-address mappings in said buffering means using a least- 
recent ly-used buffer, and wherein said updating step further includes the step of: 

comparing the virtual-memory page address of said retrieved menory-address 
Happing with a virtual-memory page address of said requested virtual-memory 
address; 

if said virtual-memory page address comparison determines a natch, storing 
said retrieved menory-address napping in said buffering means dependent upon said 
least-recent ly-used buffer and for updating said least-recently-used buffer; and. 

if said comparing means does not determine a match, incrementally 
addressing the nest memory-addressing mapping in said page table and continuing 
operation with said step of comparing the. virtual -memory page addresses. 

65. The method according to clain 55, wherein said page table contains 
at least one invalid entry. 

66. A system for nanaging virtual memory, said system including: 
a host processing deans; 

a host tenor y coupled to said host processing means to implement said 
virtual memory: 

a co-processor adapted for using virtual memory: 

virtual-physical memory mapping means, coupled to said co-processor for 
interrogating a virtual memory table and far mapping one or more virtual memory 
addresses requested by said co-processor into corresponding physical addresses In 
said host memory. 

67. The system according to clain 66, wherein said virtual memory table 
is stored in two or more hon-contlguously addressable regions of said host memory. 
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68, The. system according to claim 66, wherein said virtual-physica! 
neraory napping means further includes: 

a iiul I iole-en try translation lookaside buffer for caching virtual-to- 
5 physical memory address trans la* ions, each memory address translation including 
a vir iua [-memory page address and a corresponding physical page address. 

fid. The system according to claim 68. said virtual-physical memory 
napping means further includes: 
10 Beans for comparing, replacing, singly invalidating and multiply 

invalidating one or more entries of said translation lookaside buffer. 

70. The system according to claim 68, wherein said translation 
lookaside buffer is adapted to replace entries therein on a least recently used 

15 replacement basis. 

71. The system according to claim 68. wherein said virtual-physical 
reeaory mapping neans includes: 

means for. upon an occurrenceof amiss in said iranslation lookaside buffer, 
20 hashing a virtual memory address using a bash function to produce an index into 
said virtual menory table of said host processing means, 

72. The system according to claim 68. wherein: 

said virtual menory comprises a plurality of vir tual-nemory pages contained 
25 in said host menory: and 

said host menory comprises a plurality of physical pages. 

73. The system according to claim 72, Therein said virtual memory table 
is a page table. 

30 

74. The system according to claim 73, wherein said virtual-physical 
memory mapping neans further comprises: 

means for comparing a virtual -menory address requested by said co-processor 
with said memory address translations stored in said translation lookaside buffer; 
35 means for, if said virtual nenory address requested by said co-processor 

matches a virtual -memory address of one of said memory address translations in said 
translation lookaside buffer, providing a physical address to said co-processor 
from said matching memory address translation; and 

means for, if said vir tual -memory address requested by said co-processor 
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does not match the virtual memory address of any memory address translation 
currently stored in .said translation lookaside buffer, updating a least- 
recently-used entry of said translation lookaside buffer with a memory address 
trans lai ion retrieved from said page table, said retrieved memory address 
5 translation containing a physical address corresponding to said virtual memory 
address requested by said co-processor, said page table stored in a predetermined 
number of said physical pages. 



75. The system according lo claim 74, wherein said predetermined number 
10 of said physical pages include at least two pages stored in two or more non- 

contiguously addressable regions of said host memory. 

76. The system according to claim 74. wherein: 

said virtual memory address of said memory address translation is a virtual 
15 rage number; 

said physical address of said memory address translation is a physical page 
number in said host memory; and 

said requested virtual memory address comprises an offset within a 
respective virtual memory page, said offset being a predetermined number of lower 
20 order bits dependent on the siie of said respective virtual memory page and also 
being the offset within a corresponding physical page. 

77. The system according to claim 74. therein said pages are of a 
predetermined size, and said virtual memory page numbers and said physical memory 

25 rage numbers are dimensioned to consist of a predetermined number oi bits dependent 
upon said predetermined page si2e and a total size of said addressable memory. 

78. The system according to c lain 77. therein said virtual and physical 
page numbers include a fixed size portion or address bits dependent upon a minimum 

30 possible page size, and a variable size portion of the remaining address bits 
corresponding to a currently specified page size. 

79. The system according to claim 74, wherein said page table includes 
a predetermined number of memory address translations, each memory address 

35 translation including a virtual-memory page address and a physical page address 
in said host memory, said predetermined number of memory address translations 
corresponding to the number of said physical pages in said host memory. 



The system according to claim 74, wherein said Updating means 
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further includes: 

means for hashing a vi rtua I -memory address of a virtual-memory page of a 
first number of bits into a page table index of a second number of bits, said second 
number being less than said first number of bits. 

5 

81. The system according to claim 80, vhemri said updating means 
further includes: 

means for storing physical addresses of each of said predetermined number 
of physical pages containing said page table. 

10 

82. The system according to claim 81, wherein said page table index 
comprises a first predefined number of bits for selecting one of said physical 
addresses of said page-table physical pages, and a second predefined number of 
remaining bi Is for selecting said retrieved nemory address translation wi thin said 

15 selected page-table physical page. 

83. The system according to claim 82, further including a least- 
recently-used buffer for (racking use of said memory address translations In said 
translation lookaside buffer, and wherein said updating means further includes: 

20 means for comparing the virtual-memory page address of said retrieved 

memory address translation vith a virtual-memory page address of said requested 

virtual-memory address ; 

means for, if said virtual-memory page addresses match, storing said 

retrieved memory address translation in said translation lookaside buffer dependent 
25 upon said least-recently-used buffer and for updating said least-recently-used 

buffer: and 

means for, if said virtual-memory page addresses do not match, 
incrementally addressing the next memory-addressing mapping in said page table and 
for continuing operation vith said means for comparing the virtual -memory page 
30 addresses. 

84. The system according to ctain 74, wherein said page table contains 
at least one invalid entry, 

35 85. A recording medium storing computer-executable program codes in which a 

method according to any one of claims 1 - 7, 15 - 23 and 37 - 42 is implemented. 

fcleM of the Invention 
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The present invention relates to computer architectures and particularly 
those archi tectures dedicated to the creation of graphical images for printing or 
displaying. The present invention relates to the field of computer architecture 
5 and memory management techniques in en-processor systems. 

Background of the Invention 

The creation and printing out of complex images normally proceeds via a 
10 description of the image in a page description language (PDL) such as Postscript 

(trade nark). The page description language normal h describes how to construct 

the output image from a number of primitives and compositing operators. Often. 

a major advantage of utilizing a page description language is device independence 

in that the same description can be utilized on multiple output devices laking 
15 advantage of those features vi thin the device which can provide advantageous ei fee ts. 

Other advantages include the ability to easily edit or amend portions of the page. 

Further, optimizations on the PDL can be implemented thereby speeding up the 

rendering process. 

The process of taking a page description and creating a corresponding page 

20 for print ing from the description is knovnas rasterization and is often quite taxing 
of computer resources. Where the PDL interpretation or rasterization process is 
carried nut by a software interpreter running on a host CPU system, the software 
interpreter is likely to take up a substantial amount of the CPU resource in the 
creation of each page such that the main host CPU has little time to do anything 

25 else. Additionally, the software interpreter is likely to take an excessively long 
time where each pixel in an image Bust be decompressed and/or colour converted. 

It is known lo provide for hardware acceleration of various aspects of the 
graphical image creation process. For example, hardware JPFG compression/ 
decompression chips are generally available in the market place for performing 

3D hardware acceleration of the compression/decompression process. 

The type of output device such as a printer or display upon which an image 
is lobe created can be highly variable in its requirements for displaying an image. 
Some printer devices, once undertaking the printing of a page, require pixel 
information in respect of the whole page to be provided tn advance of printing or 

35 within predetermined tine periods and are unable to be stopped at any time during 
the printing of a page. Other output devices operate in a "banded" manner in that 
one band of the image is printed at a time and an arbitrary time period can occur 
between bands. Other output devices are able to receive updated pixel information 
in an arbitrary manner. 
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Amain object ive of the present invention is toprint out an arbitrary image 
as fas 1 as possible usingvarious printers or other output devices. In this respect, 
it is highly desirable to keep the output device "busy" printing (or displaying) 
and not wailing for the required pixel data. Every time the output device must 
5 wait fur pixel data, the printing or displaying process wi 11 obviously be delayed 
thereby leading to user frustration. Further, it is also advantageous to ensure 
thai alt resources applied I o the problem of rasterization are kept fully utilised 
in productive activi ty in order to provide a max iisi sed output printing (or display) 
speed. 

10 Further, depending upon the type of output arrangement uti 1 ueti. the result 

can be Ycry susceptible to perceived faults. For example, in some video 
applications where an image is being continually refreshed, if the new data is not 
supplied in sufficient lime, the old data can be re-displayed and no fault is 
perceived by the viewer. However, in other Yideo applications, the old image is 

is discarded at the refresh rate and thus if the new image is not available an obvious 
blank or void is present. - With printing, because the resul t ing image is permanent, 
any minor defect in the image data often produces a glaring fault. Further, as 
mentioned above, depending on the nature of the printer, the data must be supplied 
not only in perfect fom but in excess of a predetermined speed. However, with 

20 printing a delay before the final image is output can be tolerated 

With the above in mind, it is the object of the present invention to render 
image data prior to outpulting the resulting image by firstly utilizing a graphics 
co-processor in conjunction with a host processor, and secondly providing two or 
more parallel arranged data calculation streams within the graphics co-processor. 

25 [f only one of these data stream is operated at a time, it has been found that 
substantial speed advantages result when operating on streams of like image data. 
This is because the co-processor concentrates on the like sequential calculations 
and does not lose time in being reset or reconfigured to perform different 
calculations, . r r - 

30 Fur I her, whilst one calculation streaa is operating, the other (s) ean be 

reconfigured if necessary in preparation lo perform a different calculation. 

Modern computer systems typically require some method of memory nanagetnent 
to provide for dynamic memory allocation. In the case of a system with one or more 
35 co-processors, some method is necessary to synchronize between the dynamic 
allocation of memory and the use of that memory by a co-processor. 

In a typical hardware configuration oi a CPU with a specialised co-processor, 
both share a bank of memory. In such a systen, the CPU is the only entity in the 
system capable of allocating memory dynamically. Once allocated by the CPU for , 
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use by the co-processor, this memory can be used freely by Ihe co-processor until 
it is* no longer required, at which point it is able to be freed by the CPU, This 
implies that some form of synchronization is necessaTy between the CPU and the 
co-processor :n order to ensure that the memory is released only after the co- 
processor is finished using it. 

Several possible solutions to this problem have undesirable performance 
implications. Use of statically allocated memory would avoid the need for 
synchronization, but vould prevent the system from adjusting its memory resource 
usage dynamically. Alternatively, having the CPU block and wait until the co- 
processor has finished performing each operation would substantially reduce 
parallelism and hence reduce the overall system pe if omance. Similarly, the use 
of interrupts to indicate completion of operations by the co-processor would also 
impose significant processing overhead if co-processor throughput is very high. 
So these prior art solutions are not attractive. 

In addition to the need for high performance, such a system also has to 
deal with dynamic memory shortages gracefully. Most computer systems allow a wide 
range of memory size configurations. It is important that a system with large 
amounts of memory available to it make full use of the available resources to 
maximise performance. However, systems with minimal configurations must still 
perform adequately to be usable and at the very least degrade gracefully in the 
lace of a memory shortage. 

To overcome these problems, a synchronization mechanism is desired which 
will maximise system performance while also allowing co-processor memory usage to 
adjust dynamically to both the capacity of the system, and the complexity of the 
operation being performed. Hie present invention is based upon the realisation 
that after co-processnr instructions have been completed, they can be placed in 
a "clean-up" queue and fron tine to time tie memory resources allocated to these 
executed instructions can be reallocated by the CPU 

Virtual memory computer architectures have become increasingly popular In 
modern times. This popularity has increased to extent that even most 
microprocessor/personal computer systems now utilise virtual memory. Virtual 
memory has known significant advantages in simplifying the software production 
process. 

In order to speed up the operation of the computer system, it has been 
sugges ted a number of co-processors could be utilised to speed up various operations 
such as the production of graphical images for display or printing out. It would 
be advantageous if such systems could operate in conjunction utilising virtual 
memory. : t , . 
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Summary of the Invention 

In accordance with one aspect of the present invention, there is disclosed 
5 a method of rendering image data prior to output of the resulting image, said raelhod 
prising the steps of: 

(1) utilising a graphics co-processor in conjunction with a host 

processor, 

(2) within the graphics co-processor providing a plurality of data 
10 calculation streams arranged in parallel fashion, and 

(3) operating only one of said data calculation streans at any one t ime 
to perform like image calculations on a stream of like image data. 

Preferably at least one of the data calculation streams is provided with 
a rcconf igurable calculation anil and at the conclusion of step (3) the 

15 reconf igurable calculation unit is reconfigured. In addition the image data is 
preferably normalized prior to the step (3) being commenced. 

In accordance with another aspect of the present invention there is 
disclosed apparatus for rendering image data prior to output ting the resulting image, 
said apparatus comprising a graphics co-processor adapted to operation in 

20 conjunct ion wi th a host processor, said graphics co-processor having a plural ity 
of data calculation streams arranged in parallel fashion but only one of which is 
operated at any one lime to perforn like image calculations on a slreara of like 
image data. 

Preferably at least one of the data calculation streams has a reconf igurable 
25 calculation unit. Preferably a data normalization means is also provided upstream 
of the parallel data calculation streams. 

In the following detailed description, the reader's attention is directed, m particular, to 
Figs. 1, % 8 and 9 and their associated description without intending to detract &om the 
disclosure of the remainder of the description. 

30 

In accordance with one aspect of the present invention, there is disclosed 
a method of controlling the interaction between a host CPU and at least one 
co-processor in a computer system to permit substantially simultaneous decoupled 
execution of CPU instructions and co-processor instructions, and dynamic allocation 
35 of commonly used memory space during the course of the execution of said instructions, 
said method comprising the steps of: 

(a) said host CPU al locating memory resources to be utilized by a set 
of instruct ions to he co-processor executed; 



(b) generating a queue of pending co-processor instructions to be 
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executed and a clean up queue of co-processoT instructions for which execution has 
been completed ; 

(c) from time to time, under control of said host CPU, releasing for 
rc-allocat ion memory resources previously utilized by the instructions contained 
5 in said clean up queue of executed instructions. 

Preferably the release of the allocated memory is carried out after the 
execution of a specific instruction. This instruction can be the last instruction 
in a pending instructiou queue or it can be a predetermined instruction which 
utilises very substantia] memory resources, Al ternatively, the host CPU can detect 
10 that currenily free memory resources are low (or exhausted) and thereby initiate 
the release of allocated memory which is no longer ia use by the coprocessor. 

In accordance with a second aspect of the present invention there is 
disclosed dynamic nemory management means In a computer system having a memory of 
predetermined size, a host CPU and at least one co-processor, said memory management 
15 means comprising: 

(a) an instruction generator means connected with said-host CPU and 
generating a seanence of instructions intended for co-processor execution, 

(b) a memory manager means connected to said menory and said instruction 
generator means to dynamically allocate space in said memory for co-processor use 

20 in executing said sequence of co-processor instructions, 

(c) a queue manager means connected to said instruction generator means, 
said memory manager means and said co-processor, said queue manager means being 
arranged to store said sequence of instructions in a queue of pending instructions 
to be co-processor executed and a clean up queue of instructions which have been 

25 co-processor executed, 

wherein Trom time to time said queue manager means removes executed 
instructions from said clean up queue to thereby release for reallocation memory 
space previously allocated to said removed executed instructions. 

Yariaus ms of triggering the operation of the queue manger means arc 
30 preferably provided including the memory manger means being unable to satisfy a 
request for memory space or interrupting the CPU processing until a predetermined 
fraction (eg 1/3 or X) of the queue of pending co-processor instructions have been 
executed by the co-processor. 

In the following detailed description, the readers attention is directed, 
35 in particular, to Figs, i to 7 and their associated description without intending 
to detract from the disclosure of the remainder of the description. 

In accordance with a first aspect of the invention, there is provided a 



co-processor adapted fur using virtual memory with a host processing device, a 
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memory coupled to the host processing device to implement the virtual menory, the 
co-processor including: 

virtual-physical menory mapping device for interrogating a vir tual memory 
tabic and for mapping one or more virtual memory addresses requested by the 
5 co-processor into corresponding physical addresses in the memory of the host 
processing device. 

Preferably, the virtual memory table is stored in two or more non- 
contiguous ly addressable regions of the memory, and may be a page table. 

Preferably, (he virtual-physical memory mapping device further includes 
10 a uul tiple-cntry translation lookaside buffer for caching viilual-to-physical 
address mappings. The translation lookaside buffer is adapted to replace entries 
therein on a least recently used replacement basis. The virtual-physical memory 
mapping device may also include devices for comparing, replacing, singly 
invalidating and multiply invalidating one or more entries of the translation 
15 lookaside buifer. Still further, Ihe virtual-physical nemory mapping device may 
include a device for, upon an occurrence of a miss in the translation lookaside 
buffer, hashing a virtual memory address using a hash function to produce an index 
into the virtual memory table of the host processing device. 

In accordance with a second aspect of the invention, there is provided a 
20 method of operating a co-processor to use virtual memory with a hosL processing 
device, a memory coupled io the host processing device to implement the virtual 
memory, the method including the steps of: 

interrogating a virtual memory table; and 

napping one or more virtual menory addresses requested by the co-processor 
25 into corresponding physical addresses in the memory of the host processing device. 

In accordance vith a third aspect of the invention, Ihere is provided a 
menory management apparatus for a co-processor coupled to a host processor with 
virtual memory, the virtual memory comprising a plural Uy of virtual-memory pages 
contained in a hc&i memory coupled. to the host processor, the host memory comprising 
30 a plurality of physical pages, the memory management apparatus including: 

buffering device fojr caching a predetermined number of virtoal-to-physical 
memory address mappings, each memory-address mapping including a virtual-memory 
address and a corresponding physical address; 

device for conparing a virtual-memory address requested by Ihe co-processor 
35 vith the nemory-address mappings currently cached by the buffering device; 

device for, if the comparing device determines the virtual memory address 
requested by the co-processor natches a virtual -memory address of one of the 
memo ry-addr ess mappings in the buffering device, providing a physical address to 
the co-processor from the matching memory-address mapping; and 
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device for, if the comparing device determines the virtual-memory address 
requested by the co-processor dnes not match the virtual memory address of any 
memory-address mapping currently cached in the buffering device, updating the 
buffering device with a memory-address mapping retrieved from a page table, the 

5 retrieved memory-address mapping containing a physical, address corresponding to 
the virtual memory address requested by the cn-processor, the page table stored 
in a predetermined number of the physical pages. 

In accordance with a fourth aspect of the invention, there is provided a 
method of managing virtual memory for a co-processor coupled to a host processor 

10 with the virtual memory, the virtual memory comprising a plurality of virtual- 
memory pages contained in a host memory coupled to the host processor, the host 
memory comprising a plurality of physical pages, the method including the steps 
of: 

caching a predetermined number of virtual-to-physical memory address 
15 mappings in buffering device, each memory-address mapping including a virtual- 
memory address and a corresponding physical address; 

comparing a virtual-memory address requested by the co processor with the 
memory address mappings currently cached by the buffering device; 
20 if the comparison determines the virtual ncmory address requested by the 

co-processor Batches a virtual -memory address of one of the memory-address mappings 
in the buffering device, providing a physical address to the co-proccssor from the 
matching memory-address mapping: and 

if the comparison determines the virtual-nemory address requested by the 
25 co-processor does not match the virtual memory address of any memory-address mapping 
currently cached in the buffering device, updating the buffering device with a 
aenory-addrcss napping retrieved Trom a page table, the retrieved memory-address 
napping containing a physical address corresponding to the virtual memory address 
requested by the co-processor, the page table stored in a predetermined number of 
30 the physical pages. 

In accordance with a fifth aspect of the invention, there is provided a 
system Tor managing virtual memory, the system including: 
a host processing device; 

a hos t memory coupled to the host processing device to implement the virtual 

35 memory: 

a co-processor adapted for using virtual memory; 



virtual-physical memory mapping device, coupled to the co-processor, for 
interrogating a virtual memory table and for mapping one or more virtual memory 
addresses requested by the co-processor into corresponding physical addresses in 
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the host iieraory. 

In the fol lowing detailed descript ion, the reader* s attention is directed, 
in pailicular, to Figs. 150 lo 154 and their associated description without 
intending to detract from the disclosure of the renainder of the description. 
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In the preferred embodiment, a substantial advantage is gained in hardware 
rasterization by means of utilization of two independent instruction streams by a hardware 
accelerator. Hence, while the first instruction stream can be preparing a current page for 
printing, a subsequent instruction stream can be preparing the next page for printing. A 

5 Y\l?, u utili;:;*:' in " hardware rcso f . :cs is available especially - u erc the Iwdw^rc 
accelerator is able tu wurk at a speed substantially faster than the speed of the output 
device. 

The preferred embodiment describes an arrangement utilising two instruction 
streams. However, arrangements having further instruction streams can be provided 

10 where the hardware trade-offe dictate that substantial advantages can be obtained through 
the utilization of further streams. 

The utilization of two streams allows the hardware resources of the raster image 
co-processor to be kept fully engaged in preparing subsequent pages or bands, strips, etc., 
depending on the output printing device while a present page, band, etc is being 

15 forwarded to a print device. 

3, 1 General Arrangement of Plural Stream Architecture 

In Fig. 1 there is schematically illustrated a computer hardware arrangement 201 
which constitutes the preferred embodiment. The arrangement 201 includes a standard 
host computer system which takes the form of a host CPU 202 interconnected to its own 

20 memory store (RAM) 203 via a bridge 204. Hie host computer system provides all the 
normal facilities of a computer system including operating systems programs, applications, 
display of information, etc. The host computer system is connected to a standard PCI 
bus 206 via a PCI bus interface 207. The PCI standard is a well known industry standard 
{ ) and most computer systems sold today, particularly those running Microsoft Windows 

25 (trade mark) operating systems, normally come equipped with a PCI bus 206. The PCI 
bus 2U6 allows the arrangement 201 to be expanded by means of the addition of one or 
more PCI cards, eg. 209, each of which contain a further PCI bus interface 210 and other 
devices 211 and local memory 212 for utilization in the arrangement 201. 

In the preferred embodiment, there is provided a raster image accelerator card 

30 220 to assist in the speeding up of graphical operations expressed in a page description 
language. The raster image accelerator card 220 (also having a PCI bus interface 221) is 
designed to operate in a loosely coupled, shared memory manner with the host CPU 202 
in the same manner as other PCI cards 209. It is possible to add further image 
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accelerator cards 220 to the host computer system as required. The raster image 
accelerator card is designed to accelerate those operations that form the bulk of the 
execution complexity in raster image processing operations. These can include: 



W 




(b) 


Generalized Color Space Conversion 


(c) 


JPEG compression and decompression 


(<D 


Huffman, run length and predictive coding and decoding 


(e) 


Hierarchial image (Trade Mark) decompression 


(f) 


Generalized affine image transformations 


GO 


Small kernel convolutions 


00 


Matrix multiplication 


(i) 


Halftoning 


0) 


Bulk arithmetic and memory copy operations 



The raster image accelerator card 220 further includes its own local memory 223 
connected to a raster image co-processor 224 which operates the raster image accelerator 
card 220 generally under instruction from the host CPU 202. The co-processor 224 is 
preferably constructed as an Application Specific Integrated Circuit (ASIC) chip. The 
raster image co-processor 224 includes the ability to control at least one printer device 
226 as required via a peripheral interface 225. The image accelerator card 220 may also 
control any input/output device, including scanners. Additionally, there is provided on 
the accelerator card 220 a generic external interface 227 connected with the raster image 
co-processor 224 for its monitoring and testing. 

In operation, the host CPU 202 sends, via PCI bus 206, a series of instructions 
and data for the creation of images by the raster image co-processor 224. The data can 
be stored in the local memory 223 in addition to a cache 230 in the raster image co- 
processor 224 or in registers 229 also located in the co-processor 224. 

Turning now to Fig. 2, there is illustrated, in more detail, the raster image co- 
processor 224. The co-processor 224 is responsible for the acceleration of the 
aforementioned operations and consists of a number of components generally under the 
control of an instruction controller 235. Turning first to the co-processor's 
communication with the outside world, there is provided a local memory controller 236 
for communications with the local memory 223 of Fig. I. A peripheral interface 
controller 237 is also provided for the communication with printer devices utilising 
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standard formats such as the Centronics interface standard format or other video interface 
formats. The peripheral interface controller 237 is interconnected with the local memory 
controller 236. Both the local memory controller 236 and the external interface 
controller 238 are connected with an input interface switch 252 which is in turn connected 

5 to the : "-.ruction co* \~ r.er 235, The input interface s--. \\zh 252 is also a ineae { . 10 :i 
pixel organizer 246 and a data cache controller 240. The input interface switch 252 is 
provided for switching data from the external interface controller 238 and local memory 
controller 236 to the instruction controller 235, the data cache controller 240 and the pixel 
organizer 246 as required. 

10 For communications with the PCI bus 206 of Fig. 1 the external interface 

controller 238 is provided in the raster image co-processor 224 and is connected to the 
instruction controller 235. There is also provided a miscellaneous module 239 which is 
also connected to the instruction controller 235 and which deals with interactions with the 
co-processor 224 for purposes of test diagnostics and the provision of clocking and global 

15 signals. 

The data cache 230 operates under the control of the data cache controller 240 
with which it is interconnected. The data cache 230 is utilized in various ways, 
primarily to store recently used values that are likely to be subsequently utilized by the co- 
processor 224. The aforementioned acceleration operations are carried out on plural 

20 streams of data primarily by a JPEG coder/decoder 241 and a main data path unit 242. 
The units 241, 242 are connected in parallel arrangement to all of the pixel organizer 246 
and two operand organizers 247, 248. The processed streams from units 241, 242 are 
forwarded to a results organizer 249 for processing and reformatting where required. 
Often, it is desirable to store intermediate results close at band. To this end, in addition 

25 to the data cache 230, a multi-used value buffer 250 is provided, interconnected between 
the pixel organizer 246 and the result organizer 249, for the storage of intermediate data. 
The result organizer 249 outputs to the external interface controller 238, the local memory 
controller 236 and the peripheral interface controller 237 as required. 

As indicated by broken lines in Fig. 2, a further (third) data path unit 243 can, if 

30 required be connected "in parallel 11 with the two other data paths in the form of JPEG 
coder/decoder 241 and the main data path unit 242. The extension to 4 or more data 
paths is achieved in the same way. Although the paths are "parallel" connected, they do 
not operate in parallel. Instead only one path at a time operates. 
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The overall ASIC design of Fig. 2 has been developed in the following manner. 
Firstly, in printing pages it is necessary that there not be even small or transient artefacts. 
This is because whilst in video signal creation for example, such small errors If present 
may not be apparent to the human eye (and hence be uncbservable), in printing any small 
5 artefact appears permanently on the printed page and can sometimes be glaringly obvious. 
Further, any delay in the signal reaching the printer can be equally disastrous resulting in 
white, imprinted areas on a page as the page continues to move through the printer. Tt is 
therefore necessary to provide results of very high quality, very quickly and this is best 
achieved by a hardware rather than a software solution. 

10 Secondly, if one lists all the various operational steps (algorithms) required to be 

carried out for the printing process and provides an equivalent item of hardware for each 
step, the total amount of hardware becomes enormous and prohibitively expensive. Also 
the speed at which the hardware can operate is substantially limited by the rate at which 
the data necessary for, and produced by, the calculations can be fetched and despatched 

15 respectively. That is, there is a speed limitation produced by the limited bandwidth of 
the interfaces. 

However, overall ASIC design is based upon a surprising realization that if the 
enormous amount of hardware is represented schematically then various parts of the total 
hardware required can be identified as being (a) duplicated and (b) not operating all the 

20 time. This is particularly the case in respect of the overhead involved in presenting the 
data prior to its calculation. 

Therefore various steps were taken to reach the desired state of reducing the 
amount of hardware whilst keeping all parts of the hardware as active as possible. The 
first step was the realization that in image manipulation often repetitive calculations of the 

25 same basic type were required to be carried out, Thus if the data were streamed in some 
way, a calculating unit could be configured to carry out a specific type of calculation, a 
long stream of data processed and thai the calculating unit could be reconfigured for the 
next type of calculation step required. If the data streams were reasonably long, then the 
time required for reconfiguration would be negligible compared to the total calculation 

30 time and thus throughput would be enhanced. 

In addition, the provision of plural data processing paths means that in the event 
that one path is being reconfigured whilst the other path is being used, then there is 
substantially no loss of calculating time due to the necessary reconfiguration. This 
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applies where the main data path unit 242 carries out a more general calculation and the 
other data path(s) cany out more speciali2ed calculation such as JPEC coding and 
decoding as in unit 241 or, if additional unit 243 is provided, it can provide entropy and/or 
Huffman coding/decoding. 

5 Further, whih I!*e calcu! .on .vers proceeding, the fetching and presentinc nf 

data to the calculating unit can be proceeding. This process can be further speeded up, 
and hardware resources better utilized, if the various types of data are standardized or 
normalized in some way. Thus the total overhead involved in fetching and despatching 
data can be reduced. 

10 Importantly, as noted previously, the co-processor 224 operates under the control 

of host CPU 202 (Fig. 1). In this respect, the instruction controller 235 is responsible for 

the overall control of the co-processor 224. The instruction controller 235 operates the 

co-processor 224 by means of utilising a control bus 231, hereinafter known as the CBus. 

The CBus 231 is connected to each of the modules 236-250 inclusive to set registers (231 
15 of Fig. 1) within each module so as to achieve overall operation of the co-processor 224. 

In order not to overly complicate Fig. 2, the interconnection of the control bus 231 to each 

of the modules 236-250 is omitted from Kg. 2. 

Turning now to Fig. 3, there is illustrated a schematic layout 260 of the available 

module registers. The layout 260 includes registers 261 dedicated to the overall control 
20 of the co-processor 224 and its instruction controller 235. The co-processor modules 

236-250 include similar registers 262 

3*2 Host/Co-processor Queuing 

With (he above architecture in mind, it is clear that there is a need to adequately 

provide for cooperation between the host processor 202 and the image co-processor 224. 
25 However, the solution to this problem is general and not restricted to the specific above 

described architecture and therefore will be described hereafter with reference to a more 

general computing hardware environment 

Modern computer systems typically require some method of memory 

management to provide for dynamic memory allocation, in the case of a system with 
30 one or more co-processors, some method is necessary to synchronize between the 

dynamic allocation of memory and the use of that memory by a co-processor. 

Typically a computer hardware configuration has both a CPU and a specialized 

co-processor, each sharing a bank of memory. In such a system, the CPU is the only 
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entity in the system capable of allocating memory dynamically. Once allocated by the 
CPU for use by the co-processor, this memory can be used freely by the co-processor until 
it is no longer required, at which point it is available to be freed by the CPU. This 
implies that some form of synchronization is necessary between the CPU and the co- 

5 processor in order to ensure that the memory is released only after the co-processoi is 
finished using it. There are several possible solutions to this problem but each has 
undesirable performance implications. 

The use of statically allocated memory avoids the need for synchronization, but 
prevents the system from adjusting its memory resource usage dynamically. Similarly, 

10 having the CPU block and wait until the co-processor has finished performing each 
operation is possible, but this substantially reduces parallelism and hence reduces overall 
system performance. The use of interrupts to indicate completion of operations by the 
co-processor is also possible but imposes significant processing overhead if oo-processor 
throughput is very high. 

15 In addition to the need for high performance, such a system also has to deal with 

dynamic memory shortages gracefully. Most computer systems allow a wide range of 
memory size configurations. It is important that those systems with large amounts of 
memory available make full use of their available resources to maximize performance. 
Similarly those systems with minimal memory size configurations should still perform 

20 adequately to be useable and, at the very least, should degrade gracefully in the face of a 
memory shortage. 

To overcome these problems, a synchronization mechanism is necessary which 
will maximize system performance while also allowing co-processor memory usage to 
adjust dynamically to both the capacity of the system and the complexity of the operation 
25 being performed. 

In general, the preferred arrangement for synchronising the (host) CPU and the 
co-processor is illustrated in Fig. 4 where the reference numerals used are those already 
utilized in the previous description of Fig. 1. 

Thus in Fig. 108, the CPU 202 is responsible for all memory management in the 
30 system. It allocates memory 203 both for its own uses, and for use by the co-processor 
224. the co-processor 224 has its own graphics-specific instruction set, and is capable 
of executing instructions 1022 from the memory 203 which is shared with the host 
processor 202. Bach of these instructions can also write results 1024 back to the shared 
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memory 203, and can read operands 1023 from the memory 203 as well. The amount of 
memory 203 required to store operands 1023 and results 1024 of co-processor instructions 
varies according to the complexity and type of the particular operation. 

The CPU 202 is also responsible for generating the instructions 1022 executed 
by the co-processor 224. To maximize the degree of parallelism between the CPU 202 
and the co-processor 224, instructions generated by the CPU 202 are queued as indicated 
at 1022 for execution by the co-processor 224. Each instruction in the queue 1022 can 
reference operands 1023 and results 1024 in the shared memory 203, which has been 
allocated by the host CPU 202 for use by the co-processor 224. 

The method utilizes an interconnected instruction generator 1030, memory 
manager 1031 and queue manager 1032, as shown in Fig. 5. All these modules execute 
in a single process on the host CPU 202. 

Instructions for execution by the co-processor 224 are generated by the 
instruction generator 1030, which uses the services of the memory manager 1031 to 
allocate space for the operands 1023 and results 1024 of the instructions being generated. 
The instruction generator 1030 also uses the services of the queue manager 1032 to queue 
the instructions for execution by the co-processor 224. 

Once each instruction has been executed by the co-processor 224, the CPU 202 
can free the memory which was allocated by the memory manager 1031 for use by the 
operands of that instruction. The result of one instruction can also become an operand 
for a subsequent instruction, after which its memory can also be freed by the CPU. 
Rather than fielding an interrupt, and freeing such memory as soon as the co-processor 
224 has finished with it, the system frees the resources needed by each instruction via a 
cleanup function which runs at some stage after the co-processor 224 has completed the 
instruction. The exact time at which these cleanups occur depends on the interaction 
between the memory manager 1031 and the queue manager 1032, and allows the system 
to adapt dynamically according to the amount of system memory available and the 
amount of memory required by each co-processor instruction. 

Fig. 6 schematically illustrates the implementation of the co-processor 
instruction queue 1022. Instructions are inserted into a pending instruction queue 1040 
by the host CPU 202, and are read by the co-processor 224 for execution. After 
execution by the co-processor 224, the instructions remain on a cleanup queue 1041, so 
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that the CPU 202 can release the resources that the instructions required after the co- 
processor 224 has finished executing them. 

The instruction queue 1022 itself can be implemented as a fixed or dynamically 
sized circular buffer. The instruction queue 1022 decouples the generation of 
5 instructions by the CPU 202 from their execution by the co-processor 224. 

Operand and result memory for each instruction is allocated by the memory 
manager 1031 (Fig. 5) in response to requests from the instruction generator 1030 during 
instruction generation. It is the allocation of this memory for newly generated 
instructions which triggers the interaction between the memory manager 1031 and the 
10 queue manager 1032 described below, and allows the system to adapt automatically to the 
amount of memory available and the complexity of the instructions involved. 

The instruction queue manager 1032 is capable of waiting for the co-processor 
224 to complete the execution of any given instruction which has been generated by the 
instruction generator 1030. However, by providing a sufficiently large instruction queue 
15 1022 and sufficient memory 203 for allocation by the memory manager 1031, it becomes 
possible to avoid having to wait for the co-processor 224 at all, or at least until the very 
end of the entire instruction sequence, which can be several minutes on a very large job. 
However, peak memory usage can easily exceed the memory available, and at this point 
the interaction between the queue manager 1032 and the memory manager 1031 comes 
20 into play. 

The instruction queue manager 1032 can be instructed at any time to "cleanup" 
the completed instructions by releasing the memory that was dynamically allocated for 
them, (f the memory manager 1031 detects that available memory is either running low 
or is exhausted, its first recourse is to instruct the queue manager 1032 to perform such a 

25 cleanup in an attempt to release some memory which is no longer in use by the co- 
processor 224. This can allow the memory manager 1031 to satisfy a request from the 
instruction generator 1030 for memory required by a newly generated instruction, without 
the CPU 202 needing to wait for, or synchronize with, the co-processor 224. 

If such a request made by the memory manager 1031 for the queue manager 1032 

30 to cleanup completed instructions does not release adequate memory to satisfy the 
instruction generator's new request, the memory manager 1031 can request that the queue 
manager 1032 wait for a fraction, say half, of the outstanding instructions on the pending 
instruction queue 1040 to complete. This will cause the CPU 202 processing to block 




( 304 ) 1 -8 5 9 6 9 

- 33 - 

until some of the co-processor 224 instructions have been completed, at which point their 
operands can be freed, which can release sufficient memory to satisfy the request. 
Waiting for only a fraction of the outstanding instructions ensures that the co-processor 
224 is kept busy by maintaining at least some instructions in its pending instruction queue 
1040. In many cases the cleanup from the fraction of the pending instruction queue 1040 
that the CPU 202 waits for, releases sufficient memory for the memory manager 1031 to 
satisfy the request from the instruction generator 1030. 

In the unlikely event that waiting for the co-processor 224 to complete execution 
of, say, half of the pending instructions does not release sufficient memory to satisfy the 
request, then the final recourse of the memory manager 1031 is to wait until all pending 
co-processor instructions have completed. This should release sufficient resources to 
satisfy the request of the instruction generator 1030, except in the case of extremely large 
and complex jobs which exceed the system's present memory capacity altogether. 

By the above described interaction between the memory manager 1031 and the 
queue manager 1032, the system effectively tunes itself to maximize throughput for the 
given amount of memory 203 available to the system. More memory results in less need 
for synchronization and hence greater throughput. Less memory requires the CPU 202 
to wait more often for the co-processor 224 to finish using the scarce memory 203, 
thereby yielding a system which still functions with minimal memory available, but at a 
lower performance. 

The steps taken by the memory manager 1031 when attempting to satisfy a 
request from the instruction generator 1030 are summarized below. Each step is tried in 
sequence, after which the memory manager 1031 checks to see if sufficient memory 203 
has been made available to satisfy the request. If so, it stops because the request can be 
satisfied; otherwize it proceeds to the next step in a more aggressive attempt to satisfy the 
request: 

1. Attempt to satisfy the request with the memory 203 already available. 

2. Cleanup all completed instructions. 

3. Wait for a fraction of the pending instructions. 

4. Wait for all the remaining pending instructions. 

Other options can also be used in the attempt to satisfy the request, such as 
waiting for different fractions (such as one-third or two-thirds) of the pending instructions, 
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or wailing for specific instructions which arc known to be using large amounts of 
memory. 

Turning now to Fig. 7, in addition to the interaction between the memory 
manager 1031 and the queue manager 1032, the queue manager 1032 can also initiate a 
synchronization with the co-processor 224 in the case where space in a fixed-length 
instruction queue buffer 1050 is exhausted. Such a situation is depicted in Fig. 7. In 
Fig. 7 the pending instructions queue 1040 is ten instructions in length. The latest 
instruction to be added to the queue 1040 has the highest occupied number. Thus where 
space is exhausted the latest instruction is located at position 9. The next instruction to 
be input to the co-processor 224 is waiting at posidon zero. 

In such a case of exhausted space, the queue manager 1032 will also wait for. say, 
half the pending instructions to be completed by the co-processor 224. This delay 
normally allows sufficient space in the instruction queue 1040 to be freed for new 
instructions to be inserted by the queue manager 1032. 

The method used by the queue manager 1032 when scheduling new instructions 
is as follows: 

1. Test to see if sufficient space is available in the instruction queue 1040. 
2 If sufficient space is not available, wait for the co-processor to complete 

some predetermined number or fraction of instructions. 
3. Add the new instructions to the queue. 

The method used by the queue manager 1032 when asked to wait for a given 
instruction is as follows: 

1 . Wait until the co-processor 224 indicates that the instruction is complete. 

2. While there are instructions completed which are not yet cleaned up, clean 
up the next completed instruction in the queue. 

The method used by the instruction generator 1030 when issuing new 
instructions is as follows: 

t. Request sufficient memory for the instruction operands 1023 from the 
memory manger 1031. 

2. Generate the instructions to be submitted. 

3. Submit the co-processor instructions to the queue manager 1032 for 
execution. 
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The following is an example of pseudo code of the above decision making 
processes, 

MEMORY MANAGER 
5 ALLOCATE MEMORY 

BEGIN 

IF sufficient memory is NOT available to satisfy request 
THEN 

Clean up all completed instructions. 
10 ENDIF 

IF sufficient memory is still NOT available to satisfy request 
THEN 

CALL WAITJFORJNSTRUCTIONfor half the pending 

instructions. 
15 ENDW 

IF sufficient memory is still NOT available to satisfy request 
THEN 

RETURN with an error, 
ENDIF 

20 RETURN the allocated memory 

END 

QUEUE MANAGER 

SCHEDULE ^INSTRUCTION 
BEGIN 

25 IF sufficient space is NOT available In the instruction queue 

THEN 

WAIT for the co-processor to complete some predetermined 
number of instructions. 
ENDIF 

30 AM the new instructions to the queue. 

END 

WAirjrORJNSmUCTIONQ) 
BEGIN 
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WAIT until the co-processor indicates that instruction i is complete. 
WHILE there are instructions completed which are not yet cleaned 
up 
DO 

IF the next completed instruction has a cleanup Junction 
THEN 

CALL the cleanup function 
ENDIF 

REMOVE the completed instruction from the queue 
DONE 

END 

INSTRUCTION GENERATOR 

GENERATE INSTRUCTIONS 
BEGIN 

CALL ALLOCATE _MEMORY to allocate sufficient memory for 
the instructions operands from the memory manager. 
GENERATE the instructions to be submitted. 
CALL SCHEDULE JNSTRUCTION submit the co-processor 
instructions to the queue manager for execution. 

END 

33 Register Description of Co-processor 

As explained above in relation to Figs. 1 and 3, the co-processor 224 maintains 
various registers 261 for the execution of each instruction stream. 

Referring to each of the modules of Fig. 2, Table 1 sets out the name, type and 
description of each of the registers utilized by the co-processor 224 while Appendix B 
sets out the structure of each field of each register. 



Table 1: Register Description 



NAME 


TYPE | DESCRIPTION 


External Interface Controller Registers 


eic_cfg 


... 


Config2 | Configuration 



( 308 ) 
- 37 - 



4*BH¥ 11-85969 



eic_stat 


Status 


Status 


eic_errjnt 


Interrupt 


Error and Interrupt Status 


eic_errjnt_en 


Config2 


Error and Interrupt Enable 


eicjest 


Config2 


Test modes 


eic_gen_j>ob 


Config2 


Generic bus programmable output bits 


eic_high_addr 


Configl 


Dual address cycle offset 


eic_wtlb_v 


Control2 


Virtual address and operation bits for TLB 
Invalidate/Write 


eic_wtlb_p 


Config2 


Physical address and control bits for TLB 
Write 


eic_mmu_v 


Status 


Most recent MMU virtual address translated, 
and current LRU location. 


eic_rnmu_v 


Status 


Most recent page table physical address 
fetched by MMU. 


eic_ip_addr 


Status 


Physical address for most recent IBus access 
to the PQ Bus, 


eic_rp_addr 
I eicjg^addr 


Status 


Physical address for most recent RBus access 
to the PQ Bus. 




Status 


Address for most recent IBus access to the 
Generic Bus. 


eicjg_data 


Status 


Address for most recent RBus access to the 
Generic Bus. 1 
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NAME 


TYPE 


DESCRIPTION 


Local Memory Controller Registers 


lmi_cfg 


Control2 


General configuration register 


lmi_sts 


Status 


General status register 


lmi_err_int 


Interrupt 


Error and interrupt status register 


lmi_errjnt_en 


Control 


Error and interrupt enable register 


lmi_dcfg 


Control2 


DRAM configuration register 


Imijnode 


Control2 


SDRAM mode register 


Peripheral Interface Controller Registers 


pic_cfg 


Oonfig2 


Configuration 


pic_stat 


Status 


Status 


pic_err_int 


Interrupt 


Interrupt/Error Status 


pic_eirjnt_en 


Config2 


Interrupt/Error Enable 


pic_abus_cfg 


Ctatrol2 


Configuration and control for ABus 


pic_abus_addr 


Configl 


Start address for ABus transfer 


pic_cent_cfg 


Control2 


Configuration and control for Centronics 


pic_cent_dir 


Canfig2 


Centronics pin direct control register 


pic_reveise_cfg 


Control2 


Configuration and control for reverse (input) 
data transfers 


pic_timerO 


Configl 


Initial data timer value 


picjimerl 


Configl 


Subsequent data timer value 


Miscellaneous Module Registers 


mm cfg 


Config2 


Configuration Register 


mm_stat 


Status 


Status Register 


mm_err_int 


Interrupt 


Error and Interrupt Register 


mm_err_int_en 


Config2 


Error and Interrupt Masks 


mm_gefg 


Config2 


Global Configuration Register 


1 mm_diag 


Config 


Diagnostic Configuration Register 


1 mm _grst 


Config 


Global Reset Register 


1 mm gerr 


Canfig2 


Global Error Register 
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mm_j>exp 


Config2 


Global Exception Register 


mm_gint 


Config2 


Global Interrupt Register 


mm_active 


Status 


Global Active signals 
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NAME 


TYPE 


DESCRIPTION 


Instruction Controller Registers 


ic_efg 


Config2 


Configuration Register 


ic_stat 


Status/ 
Interrupt 


Status Register 


ic_err_int 


Interrupt 


Error and Intemipt Register (write to dear 
error and interrupt) 


ic_eir_int_ en 


Config2 


Error and Interrupt Enable Register 


tc_ipa 


Contrail 


A stream Instruction Pointer 


ic_tda 


Configl 


A stream Todo Register 


icjna 


Contrail 


A stream Finished Register 


ic inta 


Configl 


A stream Interrupt Register 


ic loa 


Status 


A stream Last Overlapped Instruction 
Sequence number 


icjpb 


Contioll 


B stream Instruction Pointer 


icjdb 


Configl 


B stream Todo Register 


ic_fhb 


Controll 


B stream Finished Register 


icjntb 


Configl 


B stream Interrupt Register 


ic_lob 


Status 


B stream Last Overlapped Instruction 
Sequence number 


ic_sema 


Status 


A stream Semaphore 


ic semb 


Status 


B stream Semaphore 


Data Cache Controller Registers 


dccjtfgl 


Config2 


DCC configuration 1 register 


decjstat 


Status 


state machine status bits 


dcc_err_int 


Status 


DCC error status register 


dcc_err_int_en 


Controll 


DOC error interrupt enable bits 


dcc_cfg2 


Control2 


DCC configuration 2 register | 


dcc_addr 


Configl 


Base address register for special address | 
modes. 


dcc_lvO 


Controll 

i 


" valid" bit status for lines 0 to 31 
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dccjvl 


Controll 


"valid" bit status for lines 32 to 63 


dccjv2 


Control 1 


"valid" bit status for lines 64 to 95 


dccjv3 


Controll 


"valid" bit status for lines 96 to 127 


dcc_raddrb 


Status 


Operand Organizer B request address 


dcc_raddrc 


Status 


Operand Organizer C request address 








' NAME 


TYPE 


DESCRIPTION 


dccjest 


Controll 


DCC test register 


Pixel Organizer Registers 


po_cfg 


Config2 


Configuration Register 


pojstat 


Status 


Status Register 


po_err_int 


Interrupt 


Error/Interrupt Status Register 


po_err_int_en 


Config2 


Error/Interrupt Enable Register 


podmr 


Config2 


Data Manipulation Register 


po__subst 


Config2 


Substitution Value Register 


pocdp 


Status 


Current Data Pointer 


po_len 


Controll 


Length Register 


po_said 


Controll 


Start Address or Immediate Data 


po_idr 


Control2 


Image Dimensions Register 


po_muv_valid 


Control2 


MUV valid bits 


po_jnuv 


Omfigl 


Base address of MUV RAM 


Operand Organizer B Registers 


oob_cfg 


Config2 


Configuration Register 


oobstat 


Status 


Status Register 


oob_err_jnt 


Interrupt 


Erroi/Intemipt Register 


oob_err_int_£n 


ConfigZ 


Error/Interrupt Enable Register 


oob_dmr 


Conllg2 


Data Manipulation Register 


I oob_subst 


Config2 


Substitution Value Register 


I oob__cdp 


Status 


Current Data Pointer 


I oobjen 


Controll 


Input Length Register 


| oob said 


Controll 


Operand Start Address 
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oobjile 


Contrail 


Tiling length/offset Register 


Operand Organizer C Registers 


ooc_cfg 


Config2 


Configuration Register 


oocjstat 


Status 


Status Register 


ooc_err_int 


Interrupt 


Error/Interrupt Register 


ooc_err_int_en 


Config2 


Error/Interrupt Enable Register 


oocclmr 


Config2 


Data Manipulation Register 


oocjsubst 


Config2 


Substitution Value Register 


ooc_cdp 


Status 


Current Data Pointer 


oocjen 


Contrail 


Input Length Register 


ooc_said 


Contrail 


Operand Start Address 




NAME 


TYPE 


DESCRIPTION 


oocjile 


Contrail 


Tiling length/offset Register 


JPEG Coder Register 


jccfg 


Config2 


configuration 


jcjstat 


Status 


status 


jc_err_int 


Interrupt 


error and interrupt status register 


jc_err_int_en 


Config2 


error and interrupt enable register 


jcrsi 


Configl 


restart interval 


jcjJecode 


Q)ntrol2 


decode of current instruction 


jc_res 


Controil 


residual value 


jc_table_sel 


Control2 


table selection from decoded instruction 


Main Data Path Register 


mdp_cfg 


Oonfig2 


configuration 


mdp_stat 


Status 


status 


mdp_err_int 


Interrupt 


error/interrupt 


mdp_errjnt_en 


Config2 


errotfinterrupt enable 


mdp_tcst 


Config2 


test modes 


mdp_opl 


Control2 


current operation 1 


mdp_op2 


Control 


current operation 2 
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mdp_por 


Conlroll 


offset for plus operator 


mdp__bi 


Controll 


blend start/offset to index table entry 


mdp_bm 


Control 1 


blend end or number of rows and columns in 
matrix, binary places, and number of levels 
in halftoning 


mdpjen 


Controll 


Length of blend to produce 


Result Organizer Register 


ro_cfg 


Config2 


Configuration Register 


ro_stat 


Status 


Status Register 


ro_err_int 


Interrupt 


Error/Interrupt Register 


ro_errjnt_en 


Config2 


Erroi/Interrupt Enable Register 


ro_dmr 


Config2 


Data Manipulation Register 


II ro_subst 


Configl 


Substitution Value Register 


j ro_cdp 


Status 


Current Data Pointer 


rojen 


Status 


Output Length Register 


ro_sa 


Configl 


Start Address 


ro_idr 


Configl 


Image Dimensions Register 




NAME 


TYPE 


DESCRIPTION 


ro_vbasc 


Configl 


co-processor Virtual Base Address 


ro_cut 


Configl 


Output Cut Register 


rojmt 


Configl 


Output Length Limit 


PCIBus Configuration Space alias 






A read only copy of PCI configuration space 
registers 0x0 to OxD and OxF. 


pci_extetnaljc% 


Status 


32-bit field downloaded at reset from an 
external serial ROM. Has no influence on 
coprocessor operation. 


| Input Inter! 


ace Switch Registers 


I iis__cfg 


Config2 


Configuration Register 


[ iis_stat 


Status 


Status Register 
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iis__err_int 


Interrupt 


Interrupt/Error Status Register 


ns_err_int_en 


Config2 


Interrupt/Error Enable Register 


iis_ic_addr 


Status 


Input address from IC 


iis_doc_addr 


Status 


Input address from DCC 


iis_po_addr 


Status 


Input address from PO 


iis_burst 


Status 


Burst Length from PO, DCC & IC 


iis_base_addr 


Configl 


Base address of co-pioccssor memory object 
in host memory map. 


iisjest 


Configl 


Test mode register 



The more notable ones of these registers include: 

(a) Instruction Pointer Registers (ic_ipa and ic_ipb). This pair of registers 
each contains the virtual address of the currently executing instruction. Instructions are 

5 fetched from a<jcending virtual addresses and executed. Jump instruction can be used to 
transfer control across non-contiguous virtual addresses. Associated with each 
instruction is a 32 bit sequence number which increments by one per instruction. The 
sequence numbers are used by both the co-processor 224 and by the host CPU 202 to 
synchronize instruction generation and execution. 

10 (b) Finished Registers (icjna and icjhb). This pair of registers each 

contains a sequence number counting completed instructions. 

(c) Todo Register (icjda and icjdb). This pair of registers each contains 
a sequence number counting queued instructions. 

(d) Interrupt Register (icjnta and icjntb). This pair of registers each 
13 contains a sequence number at which lo interrupt. 

(e) Interrupt Status Registers (icjrtatajmmed and ic_stot.bjwimed). 
This pair of registers each contains a primed bit which is a flag enabling the interrupt 
following a match of the Interrupt and Finished Registers. This bit appears alongside 
other interrupt enable bits and other status/configuration information in the Interrupt 

20 Status (ic_stat) register. 

(f) Register Access Semaphores (ic_sema and icjsemb). The host CPU 
202 must obtain this semaphore before attempting register accesses to the co-processor 
224 that requires atomicity, ie. more than one register write. Any register accesses not 
requiring atomicity can be performed at any time. A side effect of the host CPU 202 
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obtaining this semaphore is that co-processor execution pauses once the currently 
executing instruction has compbtcd, The Register Access Semaphore is implemented as 
one bit of the configuration/status register of the co-processor 224. These registers are 
stored in the Instruction Controllers own register area. As noted previously, each sub- 
5 module of the co-processor has its own set of configuration and status registers. These 
registers are set in the course of regular instruction execution. All of these registers 
appear in the register map and many are modified implicitly as part of instruction 
execution. These are all visible to the host via the register map. 
3.4 Format of Plural Streams 

1 o As noted previously, the co-processor 224, in order to maximize the utilization of 

its resources and to provide for rapid output on any external peripheral device, executes 
one of two independent instruction streams. Typically, one instruction stream is 
associated with a current output page required by an output device in a timely manner, 
while the second instruction stream utilizes the modules of the co-processor 224 when the 

15 other instruction stream is dormant Clearly, the overriding imperatives are to provide 
the required output data in a timely manner whilst simultaneously attempting to maximize 
the use of resources for the preparation of subsequent pages, bands, etc. The co- 
processor 224 is therefore designed to execute two completely independent but identically 
implemented instruction streams (hereafter termed A and B). The instructions are 

20 preferably generated by software running on the host CPU 202 (Fig. 1) and forwarded to 
the raster image acceleration card 220 for execution by the co-processor 224. One of the 
instruction streams (stream A) operates at a higher priority than the other instruction 
stream (stream B) during normal operation. The stream or queue of instructions is 
written into a buffer or list of buffers within the host RAM 203 (Fig. 1) by the host CPU 

25 202. The buffers are allocated at start-up time and locked into the physical memory of 
the host 203 for the duration of the application. Each instruction is preferably stored in 
the virtual memory environment of the host RAM 203 and the raster image co-processor 
224 utilizes a virtual to physical address translation scheme to determine a corresponding 
physical address with the in-host RAM 203 for the location of a next instruction. These 

30 instructions may alternatively be stored in the co-processors 224 local memory. 

Turning now to Fig. 8, there is illustrated the format of two instruction streams A 



and B 270, 271 which are stored within the host RAM 203. The format of each of the 
streams A and B is substantially identical. 




I 

I 
1 
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Briefly, the execution model for the co-processor 224 consists of: 

♦ Two virtual streams of instructions, the A stream and the B stream. 
In general only one instruction is executed at a time. 

♦ Either stream can have priority, or priority can be by way of "round 

5 robin". 

♦ Either stream can be 'locked 1 ' in, ie. guaranteed to be executed 
regardless of stream priorities or availability of instructions on the other stream. 

♦ Either stream can be empty. 

♦ Either stream can be disabled. 

ID • Either stream can contain instructions that can be "overlapped", ie. 

execution of the instruction can be overlapped with that of the following instruction if the 
following instruction is not also "overlapped". 

Each instruction has a "unique" 32 bit incrementing sequence number. 

♦ Each instruction can be coded to cause an interrupt, and/or a pause in 
15 instruction execution, 

♦ Instructions can be speculatively prefetched to minimize the impact of 
external interface latency. 

Hie instruction controller 235 is responsible for implementing the co-processor's 
instruction execution model maintaining overall executive control of the co-processor 224 
20 and fetching instructions from the host RAM 203 when required. On a per instruction 
basis, the instruction controller 235 carries out the instruction decoding and configures the 
various registers within the modules via CBus 231 to force the corresponding modules to 
1 carry-out that instruction. 

Turning now to Fig. 9, there is illustrated a simplified form of the instruction 
25 execution cycle carried out by the instructions controller 235. The instruction execution 
cycle consists of four main stages 276-279, The first stage 276 is to determine if an 
instruction is pending on any instruction stream. If this is the case, an instruction is 
fetched 277, decoded and executed 278 by means of updating registers 279. 
3.5 Determine Current Active Stream 
30 In implementing the first stage 276, there are two steps which must be taken: 



1 . Determine whether an instruction is pending; and 
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In determining whether instructions are pending the following possible 
conditions must be examined: 



1. 



whether the instruction controller is enabled; 



2. 



whether the instruction controller is paused due to an internal error or 



interrupt; 



3. 



whether there is any external error condition pending; 
whether cither of the A or B streams are locked; 
whether either stream sequence numbering is enabled; and 
whether either stream contains a pending instruction. 



4. 



5. 



6. 



The following pseudo code describes the algorithm for determining whether an 
instruction is pending in accordance with the above rules. This algorithm can be 
hardware implemented via a state transition machine within the instruction controller 235 
in known manner 

if not error and enabled and not bypassed and not self test mode 
if A stream locked and not paused 

if A stream enabled and (A stream 
sequencing disabled or instruction on A stream) 

instruction pending 
else 

no instruction pending 
end if 



if B stream enabled and (B stream 
sequencing disabled or instruction on B stream) 

instruction pending 
else 

no instruction pending 
end if 



if (A stream enabled and not paused and (A 
stream sequencing disabled or instruction on A stream)) 



else 



ifB stream locked and not paused 



else 



/* no stream is locked */ 
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or (B stream enabled and not paused and 
(B stream sequencing disabled or instruction on B stream)) 

instruction pending 
else 

5 no instruction pending 

end if 

end if 

else /* interface controller not enabled */ 

no instruction pending 

10 end if 

If no instruction is found pending, then the instruction controller 235 will "spin" 
or idle until a pending instruction is found. 

To determine which stream is "active", and which stream is executed next, the 
15 following possible conditions are examined: 

1. whether either stream is locked; 

2. what priority is given to the A and B streams and what the last 
instruction stream was; 

3. whether either stream is enabled; and 

20 4. whether either stream contains a pending instruction. 

The following pseudo code implemented by the instruction controller describes 
how to determine the next active instruction stream: 

if A stream locked 
25 next stream is A 

else ifB stream locked 

next stream is B 
else /* no stream is locked */ 

if (A stream enabled and (A stream sequencing disabled or instruction 
30 on A stream)) and not (B stream enabled and (B stream sequencing disabled or 
instruction on B stream)) 
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else if (B stream enabled and (B stream sequencing disabled or 
instruction on B stream)) and not (A stream enabled and (A stream sequencing disabled 
or instruction on A stream)) 

next stream isB 

5 else i* both stream have instruction *l 

ijpri = 0 t*A high, B low */ 

next stream is A 
else ifpri = 1 I* A low, B high */ 
next stream is B 

10 else ifpri = 2 or 3 /* round robin */ 

if last stream is A 

next stream is B 

else 

next stream is A 
15 end if 

end if 

end if 

end if 

20 As the conditions can be constantly changing, all conditions must be determined 

together atornically. 

3*6 Fetch Instruction of Current Active Stream 

After the next active instruction stream is determined, the Instruction Controller 
235 fetches the instruction using the address in the corresponding instruction pointer 
25 register (ic_ipa or icjpb). However, the Instruction Controller 235 does not fetch an 
instruction if a valid instruction already exists in a prefetch buffer stored within the 
instruction controller 235, 

A valid instruction is in the prefetch buffer if: 

1. the prefetch buffer is valid; and 
30 2. the instruction in the prefetch buffer is from the same stream as the 

currently active stream. 

The validity of the contents of the prefetch buffer is indicated by a prefetch bit in 
the ic_stat register, which is set on a successful instruction prefetch. Any external write 
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to any of the registers of the instruction controller 235 causes the contents of the prefetch 

buffer to be invalidated. 

3.7 Decode and Execute Instruction 

Once an instruction has been fetched and accepted the instruction controller 235 
5 decodes it and configures the registers 229 of the co-processor 224 to execute the 
instruction. 

The instruction format utilized by the raster image co-processor 224 differs from 
traditional processor instruction sets in that the instruction generation must be carried out 
instruction by instruction by the host CPU 202 and as such is a direct overhead for the 

10 host. Further, the instructions should be as small as possible as they must be stored in 
host RAM 203 and transferred over the PCI bus 2D6 of Fig. 1 to the co-processor 224. 
Preferably, the co-processor 224 can be set up for operation with only one instruction. 
As much flexibility as possible should be maintained by the instruction set to maximize 
the scope of any future changes. Further, preferably any instruction executed by the co- 

15 processor 224 applies to a long stream of operand data to thereby achieve best 
performance. The co processor 224 employs an instruction decoding philosophy 
designed to facilitate simple and fast decoding for "typical instructions" yet still enable the 
host system to apply a finer control over the operation of the co-prooessor 224 for 
"atypical" operatioas. 

20 Turning now to Fig. 10, there is illustrated the format of a single instruction 280 

which comprizes eight words each of 32 bits. Each instruction includes an instruction 
word or opcode 281, and an operand or result type data word 282 setting out the format of 
the operands. The addresses 283-285 of three operands A, B and C are also provided, in 
addition to a result address 286. Further, an area 287 is provided for use by the host 

25 CPU 202 for storing infonnation relevant to the instruction. 

The structure 290 of an instruction opcode 281 of an instruction is illustrated in 
Fig. 11. The instruction opcode is 32 bits long and includes a major opcode 291, a 
minor opcode 292, an interrupt (I) bit 293, a partial decode (Pd) bit 294, a register Length 
(R) bit 295, a lode (L) bit 296 and a length 297. A description of the fields in the 

30 instruction word 290 is as provided by the following table. 



Table 2: Opcode Description 
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major opcode [3..0] 


Instruction category 




0: 


Reserved 




1: 


General Colour Space Conversion 




2: 


JPEG Compression and Decompression 




3: 


Matrix Multiplication 




4: 


Image Convolutions 




5: 


Image Transformations 




6: 


Data Coding 




1: 


Halftone 




8: 


Hierarchial image decompression 




9: 


Memory Copy 




10: Internal Register and Memory Access 




11: 


Instruction Flow Control 




12: Compositing 




13: Compositing 




14: Reserved 




15: Reserved 


minor opcode 


Instruction detail. The coding of this field is dependent 


[7^0] 


on the major opcode. 


I 


1 = 


Interrupt and pause when competed, 




0 = 


Don't interrupt and pause when completed 



| Field 


Description 


pd 


Partial Decode 

1» use the "partial decode" mechanism. 

0 - Don't use the "partial decode" mechanism 


R 


1 = length of instruction is specified by the Pixel 
Organizer's input length register (po_lcn) 
0 - length of instruction is specified by the opcode 
length field. 


L 


1 = this instruction stream (A or B) is "locked" for 
the next instruction. 
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0 = this instruction stream (A or B) is not "locked 11 
in for the next instruction. 


length [15..0] 


number of data items to read or generate 



By way of discussion of the various fields of an opcode, by setting the I-bit field 
293, the instruction can be coded such that instruction execution sets an interrupt and 
pause on completion of that instruction. This interrupt is called an "instruction 
5 completed interrupt". The partial decode bit 294 provides for a partial decode 
mechanism such that when the bit is set and also enabled in the ic_cfg register, the various 
modules can be micro coded prior to the execution of the instruction in a manner which 
will be explained in more detail hereinafter. The lock bit 296 can be utilized for 
operations which require more than one instruction to set up. This can involve setting 

10 various registers prior to an instruction and provides the ability to "lock" in the current 
instruction stream for the next instruction. When the L-bit 296 is set, once an instruction 
is completed, the next instruction is fetched from the same stream. The length field 297 
has a natural definition for each instruction and is defined in terms of the number of 
!, input data items" or the number of "output data items" as required. The length field 297 

15 is only 16 bits long. For instructions operating on a stream of input data items greater 
than 64,000 items the R-bit 295 can be set, in which case the input length is taken from a 
pojen register within the pixel organizer 246 of Fig. 2. This register is set immediately 
before such an instruction. 

Returning to Fig. 10, the number of operands 283-286 required for a given 

20 instruction varies somewhat depending on the type of instruction utilized. The following 
table sets out the number of operands and length definition for each instruction type: 
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Table 3: Operand Types 



Instruction 
Class 


Length defined by 


#of 
operands 


Cdmpositing 


input 


pixels 


3 


Genera] Color Space Conversion 


input 


pixels 


2 


JPEG decompression/compression 


input 


bytes 


2 


other decompression/compression 


input 


bytes 


2 


Image Transformations and 
Convolutions 


output 


bytes 


2 


Matrix Multiplication 


input 


pixels 


2 


Halftoning 


input 


pixels, bytes 


2 


Memory Copying 


input 


pixels, bytes 


1 


Hierarchial Image Decompression 


input 


pixels, bytes 


1 or 2 


Flow Control 


fixed 


fixed 


2 


Internal Access Instructions 


fixed 


fixed 


4 



Turning now to Fig. 12, there is illustrated, firstly, the data word format 300 of 
the data word or operand descriptor 282 of Fig. 10 for three operand instructions and, 
5 secondly, the data word format 301 for two operand instructions. The details of the 
encoding of the operand descriptors are provided in the following table: 



Table 4: Operand Descriptors 



Field 


Description 


what 


0 = instruction specific mode: 

This indicates that the remaining fields of the descriptor will be 
interpreted in line with the major opcode. Instruction specific 
modes supported are: 
major opcode «= 0-11: Reserved 

major opcode = 12-13: (Compositing): Implies that Operand C is a 
bitmap attenuation. The occ_dmr register will be set 
appropriately, with the cc=l and normalized 
major opcode = 1 4-15: Reserved j 
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1 = sequential addressing 




2 = tile addressing 




3 - constant data 

~ , ™J 
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Description 


L 


0 = not long: immediate data 

1 = long: pointer to data 


if 


in tenia! format: 

0 = pixels 

1 = unpacked bytes 

2 = packed bytes 

3 = other 


S 


0 = set up Dala Manipulation Register as appropriate for this 
operand 

1 = use the Data Manipulation Register as is 


C 


0 = not cacheable 

1 = cacheable 

Note: In general a performance gain will be achieved if an 
operand is specified as cacheable. Even operands displaying low 
Levels of referencing locality (such as sequential data) still benefit 
from being cached - as it allows data to be burst transferred to the 
host processor and is more efficient 


P 


external format: 

0 = unpacked bytes 

1 = packed stream 


bo[2:0] 


bit offset. Specifies the offset within a byte of the start of bitwize 
data. 


R 


0 = Operand C does not describe a register to set. 

1 = Operand C describes a register to set 

This bit is only relevant for instructions with less than three 
operands. 



With reference to the above table, it should be noted that, firstly, in respect of the 
constant data addressing mode, the co-processor 224 is set up to fetch, or otherwize 
calculate, one internal data item, and use this item for the length of the instruction for that 

- - * i - i ■ , ■ : vi ; 3 i i I 
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operand. In the tile addressing mode, the co-processor 224 is set up to cycle through a 
small set of data producing a "tiling effect". When the L-bit of an operand descriptor is 
zero then the data is immediate, ie. the data items appear literally in the operand word. 

Returning again to Fig. 10, each of the operand and result words 283-286 
5 contains either the value of the operand itself or a 32-bit virtual address to the start of the 
operand or result where data is to be found or stored. 

The instruction controller 235 of Fig. 2 proceeds to decode the instruction in two 
stages. It first checks to see whether the major opcode of the instruction is valid, raising 
an error if the major opcode 291 (Fig. 11) is invalid. Next, the instruction is executed by 
10 the instruction controller 235 by means of setting the various registers via CBus 231 to 
reflect the operation specified by the instruction. Some instructions can require no 
registers to be set. 

The registers for each module can be classified into types based on their behavior. 
Firstly, there is the status register type which is "read only" by other modules and 

15 "read/write" by the module including the register. Next, a first type of configuration 
register, hereinafter called T, u>nfigr', is "read/write" externally by the modules and "read 
only" by the module including the register. These registers are normally used for 
holding larger type configuration information, such as address values. A second type of 
configuration register, herein known as f, config2 n , is readable and writable by any module 

20 but is read only by the module including the register* This type of register is utilized 
where bit by bit addressing of the register is required. 

A number of control type registers are provided. A first type, hereinafter known 
as "con troll" registers, is readable and writable by all modules (including the module 
which includes the register). The controll registers are utilized for holding large control 

25 information such as address values. Analogously, there is further provided a second type 
of control register, hereinafter known as "controE", which can be set on a bit by bit basis* 
A final type of register known as an interrupt register has bits within the register 
which are settable to 1 by the module including the register and resettable to zero 
externally by writing a "1" to the bit that has been set This type of register is utilized for 

30 dealing with the interrupts/errors flagged by each of the modules. 

Each of the modules of the co-processor 224 sets a c.active line on the CBus 231 
when it is busy executing an instructioa The instruction controller 235 can then 
determine when instructions have been completed by "OR-ing 1, the c_active lines coming 
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from each of the modules over the CBiis 231. The local memory controller module 236 
and the peripheral interface controller module 237 are able to execute overlapped 
instructions and include a c_background line which is activated when they are executing 
an overlapped instruction. The overlapped instructions are "local DMA" instructions 
5 transferring daia between the I . . A memory interface and the peripheral interface. 

The execution cycle for an overlapped local DMA instruction is slightly different 
from the execution cycle of other instructions. If an overlapped instruction is 
encountered for execution, the instruction controller 235 checks whether there is already 
an overlapped instruction executing. If there is, or overlapping is disabled, the 

10 instruction controller 235 waits for that instruction to finish before proceeding with 
execution of that instruction. If there is not, and overlapping is enabled, the instruction 
controller 235 immediately decodes the overlapped instruction and configures the 
peripheral interface controller 237 and local memory controller 236 to carry put the 
instruction. After the register configuration is completed, the instruction controller 235 

15 then goes on to update its registers (including finished register, status register, instruction 
pointer, etc.) without waiting for the instruction to "complete" in the conventional sense. 
At this moment, if the finished sequence number equals the interrupt sequence number, 
'the overlapped instruction completed 1 interrupt is primed rather than raising the interrupt 
immediately. The 'overlapped instruction completed' interrupt is raized when the 

20 overlapped instruction has fully completed. 

Once the instruction has been decoded, the instruction controller attempts to 
prefetch the next instruction while the current instruction is executing. Most instructions 
take considerably longer to execute than they will io fetch and decode. The instruction 
controller 235 prefetches an instruction if all of the following conditions are met: 

25 1. the currently executing instruction is not set to interrupt and pause; 

2. the currently executing instruction is not a jump instruction; 

3. the next instruction stream is prefetch-enabled; and 

4. there is another instruction pending. 

If the instruction controller 235 determines that prefetching is possible it requests 



30 the next instruction, places it in a prefetch buffer and then validates the buffer. At this 
point there is nothing more for the instruction controller 235 to do until the currently 
executing instruction has completed* The instruction controller 235 determines the 
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completion of an instruction by examining the c_active and c_background lines associated 
with theCBus 23L 

3.8 Update Registers of Instruction Controller 

Upon completion of an instruction, the instruction controller 235 updates its 
5 registers to reflect the new state. This must be done atomically to avoid problems with 
synchronising with possible external accesses. This atomic update process involves; 

1. Obtaining the appropriate Register Access Semaphore, If the 
semaphore is taken by an agent external to the Instruction Controller 235, the instruction 
execution cycle waits at this point for the semaphore to be released before proceeding. 
ID 2. Updating the appropriate registers. The instruction pointer (ic_ipa or 

icjpb) is incremented by the size of an instruction, unless the instruction was a successful 
jump, in which case the target value of the jump is loaded into the instruction pointer. 

The finished register (icjha or ic_fnb), is then incremented if sequence 
numbering is enabled. 

15 The status register (ic_stat) is also updated appropriately to reflect the new state. 

This includes setting the pause bits if necessary. The Instruction Controller 235 pauses 
if an interrupt has occurred and pausing is enabled for that interrupt or if any error has 
occurred. Pausing is implemented by setting the instruction stream pause bits in the 
status register (a pause or b_pausc bits in ic_stat). To resume instruction execution, 

29 these bits should be reset to 0. 

3. Assorting a c_end signal on the CBus 231 for one clock cycle, which 
indicates to other modules in the co-processor 224 that an instruction has been completed. 

4. Raising an interrupt if required An interrupt is raized if: 

a. "Sequence number completed" interrupt occurs. That is, if the finished 
25 register (ic_fna or icjhb) sequence number is the same as interrupt sequence number. 

Then this interrupt is primed, sequence numbering is enabled, and the interrupt occurs; or 

b. the just completed instruction was coded to interrupt on completion, 
then this mechanism is enabled. 

3.9 Semantics of the Register Access Semaphore 

30 The Register Access Semaphore is a mechanism that provides atomic accesses to 
multiple instruction controller registers. Hie registers that can require atomic access are 
as follows: 

i. Instruction pointer register (icjpa and icjpb) . j > 
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2. Todo registers (icjda and ic_tdb) 

3. Finished registers (r fna and ic_fnb) 

4. Interrupt registers (ic_inta and icjntb) 

5. The pause bits in the configuration register (ic_cfg) 

5 Extern . ;-e t *is can read af registers safely ar - :ime. F.xternai *;;ents 

able to write any registers at any time, however to ensure that the Instruction Controller 
235 does not update values in these registers, the external agent must first obtain the 
Register Access Semaphore. The Instruction Controller does not attempt to update any 
values in the abovementioned registers if the Register Access Semaphore is claimed 

10 externally. The instruction controller 235 updates all of the above mentioned registers in 
one clock cycle to ensure atomicity. 

As mentioned above, unless the mechanism is disabled, each instruction has 
associated with it a 32 bit "sequence number". Instruction sequence numbers increment 
wrapping through from QxFFFFFFFF to QxOOOOODOO. 

15 When an external write is made into one of the Interrupt Registers (icjnta or 

icjntb), the instruction controller 235 immediately makes the following comparisons and 
updates: 

1. If the interrupt sequence number (ie, the value in the Interrupt Register) 
is "greater 11 (in a modulo sense) than the finished sequence number (ie. the value m the 

20 Finished Register) of the same stream, the instruction controller primes the "sequence 
number completed" interrupt mechanism by setting the "sequence number completed" 
primed bit (a_primed or b_primed bit in ic_stat) in the status register. 

2. If the interrupt sequence number is not "greater" than the finished 
sequence number, but there is an overlapped instruction in progress in that stream and the 

25 interrupt sequence number equals the last overlapped instruction sequence number (ie. the 
value in the icjoa or icjob register), then the instruction controller primes the 
"overlapped instruction sequence number completed" interrupt mechanism by setting the 
a_ol_pritned or bjol_primed bits in the ic_stat register, 

3. If the interrupt sequence number is not "greater" than the finished 
30 sequence number, and there is an overlapped instruction in progress in that stream, but the 

interrupt sequence number does not equal the last overlapped instruction sequence 
number, then the interrupt sequence number represents a finished instruction, and no 
interrupt mechanism is primed. 
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4. If the interrupt sequence number is not "greater" than the finished 
sequence number, and there is no overlapped instruction in progress in that stream, then 
the interrupt sequence number must represent a finished instruction, and no interrupt 
mechanism is primed, 

5 External agents can set any of the interrupt primed bits (bits ajrimed, 

a_6l_primed, b_primed or bj>l_primed) in the status register to activate or de-activatc 
this interrupt mechanism independently. 
3,10 Instruction Controller 

Turning now to Fig. 13, there is illustrated the instruction controller 235 in more 

10 detail. The instruction controller 235 includes an execution controller 305 which 
implements the instruction execution cycle as well as maintaining overall executive 
control of the co-processor 224. Hie functions of the execution controller 305 include 
maintaining overall executive control of the instruction controller 235, determining 
instructing sequencing, instigating instruction fetching and prefetching, initiating 

15 instructing decoding and updating the instruction controller registers. The instruction 
controller further includes an instruction decoder 306. The instruction decoder 306 
accepts instructions from a prefetch buffer controller 307 and decodes them according the 
aforementioned description. The instruction decoder 306 is responsible for configuring 
registers in the other co-processor modules to execute the instruction. The prefetch 

2U buffer controller 307 manages the reading and writing to a prefetch buffer within the 
prefetch buffer controller and manages the interfacing between the instruction decoder 
306 and the input interface switch 252 (Fig. 2). The prefetch buffer controller 307 is 
also responsible for managing the updating of the two instruction pointer registers (icjpa 
and icjpb). Access to the CBus 231 (Fig. 2) by the instruction controller 235, the 

25 miscellaneous module 239 (Fig. 2) and the external interface controller 238 (Fig. 2) is 
controlled by a "CBus" arbitrator 308 which arbitrates between the three modules 1 request 
for access. The requests are transferred by means of a control bus (CBus) 231 to the 
register units of the various modules. 

Turning now to Fig. 14, there is illustrated the execution controller 305 of Fig. 



30 13 in more detail. As noted previously, the execution controller is responsible for 
implementing the instruction execution cycle 275 of Fig. 9 and, in particular, is 
responsible for: ; , 4 ; 
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1. Determining which instruction stream the next instruction is to come 

from; 

2. Initiating fetching of that instruction; 

3. Signalling the instruction decoder to decode the instruction as residing 
5 in the prefetch buffer; 

4. Determining and initialing any prefetching of ihe next instruction; 

5. Determining instruction completion; and 

6. Updating the registers after the instruction has completed. 

The execution controller includes a large core state machine 310 hereinafter 

10 known as "the central brain" which implements the overall instruction execution cycle. 
Turning to Kg. 15, there is illustrated the state machine diagram for the central brain 310 
implementing Ihe instruction execution cycle as aforementioned. Returning to Fig. 14, 
the execution controller includes an instruction prefetch logic unit 311. This unit is 
responsible for determining whether there is an outstanding instruction to be executed and 

15 which instruction stream the instruction belongs to. The start 312 and prefetch 313 
states of the transition diagram of Fig. 15 utilize this information in obtaining instructions. 
A register management unit 317 of Fig. 14 is responsible for monitoring the register 
access semaphores on both instruction streams and updating all necessary registers in each 
module. The register management unit 317 is also responsible for comparing the 

20 finished register (icjiha or ic_fhb) with the interrupt register (ic_inta or ic_intb) to 
determine if a Sequence number completed" interrupt is due. The register management 
unit 317 is also responsible for interrupt priming. An overlapped instructions unit 318 is 
responsible for managing the finishing off of an overlapped instruction through 
management of the appropriate status bits in the ic_stat register. The execution 

25 controller also includes a decoder interface unit 319 for interfacing between the central 
brain 310 and the instruction decoder 306 of Fig. 13. 

Turning now to Fig. 16, there is illustrated the instruction decoder 306 in more 
detail. The instruction decoder is responsible for configuring the co-processor to execute 
the instructions residing in the prefetch buffer. The instruction decoder 306 includes an 

30 instruction decoder sequencer 321 which comprizes one large state machines broken 
down into many smaller state machines, The instruction sequencer 321 communicates 
with a CBus dispatcher 312 which is responsible for setting the registers within each 
module. The instruction decoder sequencer 321 also communicates relevant information 
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to the execution controller such as instruction validity and instruction overlap conditions. 
The instruction validity check being to check that the instruction opcode is not one of the 
reserved opcodes. 

Turning now to Fig. 17, there is illustrated, in more detail, the instruction 
5 dispatch sequencer 321 of Fig. 16. The instruction dispatch sequencer 321 includes a 
overall sequencing control state machine 324 and a series of per module configuration 
sequencer state machines, eg. 325, 326. One per module configuration sequencer state 
machine is provided for each module to be configured. Collectively the state machines 
implement the co-processor's microprogramming of the modules. The state machines, 

10 eg. 325, instruct the CBus dispatcher to utilize the global CBus to set various registers so 
as to configure the various modules for processing. A side effect of writing to particular 
registers is that the instruction execution commences. Instruction execution typically 
takes much longer than the time it takes for the sequencer 321 to configure the co- 
processor registers for execution. In appendix A, attached to the present specification, 

15 there is disclosed the microprogramming operations performed by the instruction 
sequencer of the co-processor in addition to the form of set up by the instruction 
sequencer 321. 

In practice, the Instruction Decode Sequencer 321 does not configure all of the 
modules within the co-processor for every instruction. The table below shows the 

20 ordering of module configuration for cadi class of instruction with the module configured 
including the pixel organizer 246 (PO), the data cache controller 240 (DCC), the operand 
organizer B 247 ((DOB), the operand organizer C 248 (OOQ, main data path 242 (MDP). 
results organizer 249 (RO), and JPEG encoder 241 (JC). Some of the modules are never 
configured during the course of instruction decoding. These modules are the External 

25 Interface Controller 238 (EIC), the Local Memory Controller 236 (LMC) 7 the Instruction 
Controller 235 itself (1C), the Input Interface Switch 252 (IIS) and the Miscellaneous 
Module (MM). 
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Table 5: Module Setup Order 



Instruction 
Class 


Module Configuration 
Sequence 


1 

Sequence 

ID 




PO, DCC, OOB, OOC, MDP, RO 


1 


CSC 


PO, DCC, OOB, OOC, MDP, RO 


2 


JPEG coding 


PO, DCC, OOB, OOC, JC, RO 


3 


Data coding 


PO, DCC, OOB, OOC, JC, RO 


3 


Transformations and 
Convolutions 


PO, DCC, OOB, OOC, MDP, RO 


2 


Matrix Multiplication 


PO, DCC, OOB, OOQ MDP, RO 


2 


Halftoning 


PO,DCC,OOB ) MDP,RO 


4 


General memory copy 


PO, JC RO 


8 


Peripheral DMA 


PIC 


5 


Hierarchial Image - 
Horizontal Interpolation 


PO, DCC, OOB, OOC, MDP, RO 


6 


Hierarchial Image - others 


PO, DCC, OOB, OOC, MDP, RO 


4 


Internal access 


RO, RO,RO,RO 


7 


others 







Turning now to Fig. 17, each of the module configuration sequencers, eg. 323 is 
responsible for carrying out the required register access operations to configure the 
particular module. The overall sequencing control state machine 324 is responsible for 
overall operation of the module configuration sequencer in the aforementioned order. 

Referring now to Fig. 18, there is illustrated 330 the state transition diagram for 
the overall sequencing control unit which basically activates the relevant module 
configuration sequencer in accordance with the above table. Each of the modules 
configuration sequencers is responsible for controlling the CBus dispatcher to alter 
register details in order to set the various registers in operation of the modules. 
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Turning now to Fig. 19, there is illustrated the prefetch buffer controller 307 of 
Fig. 13 in more detail. The prefetch buffer controller consists of a prefetch buffer 335 
for the storage of a single co-processor instruction (six times 32 bit words). The prefetch 
buffer includes one write port controlled by a IBus sequencer 336 and one read port which 

5 provides data to the instruction decoder, execution controller and the instruction controller 
CBus interface. The [Bus sequencer 336 is responsible for observing bus protocols in 
the connection of the prefetch buffer 335 to the input interface switch. An address 
manager unit 337 is also provided which deals with address generation for instruction 
fetching. The address manager unit 337 performs the functions of selecting one of 

10 icjpa or icjpb to place on the bus to the input interface switch, incrementing one of 
icjpa or ic_ipb based on which stream the last instructions was fetched from and 
channelling jump target addresses back to the ic ipa and icjpb register. A PBC 
controller 339 maintains overall control of the prefetched buffer controller 307. 
3.11 Description of a Modules Local Register File 

15 As illustrated in Fig- 13, each module, including the instruction controller 

module itself, has an internal set of registers 304 as previously defined in addition to a 
CBus interface controller 303 as illustrated in Fig. 20 and which is responsible for 
receiving CBus requests and updating internal registers in light of those requests. The 
module is controlled by writing registers 304 within the module via a CBus interface 302. 

20 A CBus arbitrator 308 (Fig. 13) is responsible for determining which module of the 
instruction controller 235, the external interface controller or the miscellaneous module is 
able to control the CBus 309 for acting as a master of the CBus and for the writing or 
reading of registers. 

Fig. 20, illustrates, in more detail, the standard structure of a CBus interface 303 
23 as utilized by each of the modules. The standard CBus interface 303 accepts read and 
write requests from the CBus 302 and includes a register file 304 which is utilized 341 
and updated on 341 by the various submodules within a module. Further, control lines 
344 arc provided for the updating of any suhmodule memory areas including reading of 
the memory areas. The standard CBus interface 303 acts as a destination on the CBus, 
30 accepting read and write requests for the register 304 and memory objects inside other 
submodules. 

A "cjresct" signal 345 sets every register inside the Standard CBus interface 103 
to their default states. However, h c_feset" will not reset the state machine that controls 
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the handshaking of signals between itself and the CBus Master, so even if "cjreset" is 
asserted in the middle of a CBus transaction, the trar«*::ction will still finish, with 
undefined effects. The "c_int" 347, "c_exp" 348 and "c_en" 349 signals are generated 
from the content of a modules errjnt and err_int_en registers by the following equations: 

c_err = ^ errorri] AND err_mask[i] (1) 

errorfi] not reserved 



c_ int - y interrupt[i] AND int_ mask[i] (2) 

interrBpll(jnot reserved 

10 

c_ exp « ^ exception[i] AND exp_ mask[i] (3) 

[i] net reserved 

The signals "c.sdatajn* 345 and n c_svalidjif are data and valid signals from the previous 
15 module in a daisy chain of modules. The signals w c_sdata_out" and lf c_£valid_Dut" 350 aie data and valid 
signals going to the next module in the daisy chain. 

The functionality of the Standard CBus interface 303 includes; 
1 „ register read/write handling 

2. memory area read/write handling 

20 3. test mode read/write handling 

4. submodule observe/update handling 

3.12 Register Read/Write Handling 

The Standard CBus Interface 303 accepts register read/write and bit set requests 
that appears on the CBus. There are two types of CBus instructions that Standard CBus 
25 Interface handles: 
1. Type A 

Type A operations allow other modules to read or write 1, 2, 3, or 4 bytes into 
any register inside Standard CBus Interface 303. For write operations, the data cycle 
occurs in the clock cycle immediately after the instruction cycle. Note that the type field 
30 for register write and read are "1000" and "1001" respectively. The Standard CBus 
Interface 303 decodes the instruction to check whether the instruction is addressed to the 
module, and whether it is a read or write operation. For read operation, the Standard 
CBus Interface 303 uses the "reg" field of the CBus transaction to select which register 
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output is to put into the !, c_sdata" bus 350. For write operations, the Standard CBus 
Interface 303 uses the "reg" and "byte" fields to write the data into the selected register. 
After read operation is completed, the Standard CBus Interface returns the data and 
asserts "c_svalid v 350 at the same time. After write operations are completed, the 
5 Standard CBus Interface 303 asserts "c_svalld H 350 to acknowledge. 
2. Type C 

Typ& C operations allow other modules to write one or more bits in one of the 
bytes in one of the registers. Instruction and data are packed into one word. 

The Standard CBus Interface 303 decodes the instruction to check whether the 
10 instruction is addressed to the module. It also decodes "reg", "byte" and "enable" fields 
to generate the required enable signals. It also latches the data field of the instruction, 
and distributes it to all four bytes of a word so the required bit(s) are written in every 
enabled bit(s) in every enabled byte(s). No acknowledgment is required for this 
operation. 

15 3.13 Memory Area Read/Write Handling 

The Standard CBus Interface 303 accepts memory read and memory write 
requests that appears on the CBus. While accepting a memory read/write request, the 
Standard CBus Interface 303 checks whether the request is addressed to the module. 
Then, by decoding the address field in the instruction, the Standard CBus Interface 

20 generates the appropriate address and address strobe signals 344 to the submodule which 
a memory read/write operation is addressed to. For write operations the Standard CBus 
Interface also passes on the byte enable signals from the instruction to the submodules. 

The operation of the standard CBus interface 303 is controlled by a read/write 
controller 352 which decodes the type Geld of a CBus instruction from the CBus 302 and 

25 generates the appropriate enable signals to the register file 304 and output selector 3S3 so 
that the data is latched on the next cycle into the register file 304 or forwarded to other 
submodules 344. If the CBus instruction is a register read operation, the read/write 
controller 352 enables the output selector 353 to select the correct register output going 
onto the "csdata bus" 345. If the instruction is a register write operation, the read/write 

30 controller 352 enables the register file 304 to select the data in the next cycle. If the 
instruction is a memory area read or write, then the read/write controller 352 generates the 
appropriate signals 344 to control those memory areas under a modules control. Hie 
register File 304 contains four parts, being a register select decoder 355, an output selector 
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353, interrupt 356, error 357 and exception 358 generators, unmasked error generator 359 

and ;he roaster components 360 which r.ake up the registers of that particular rrr-A:I* 

The register select decoder 355 decodes the signal "ref.en" (register Ble enable), "write" 

and "reg" from the read/write controller 352 and generates the register enable signals for 
5 ouubli-u the priwieular register .;f interest. Tht o^gui selector 353 se : .. the cor. '-*. 

register data to be output on c_sdata_out lines 350 for register read operations according 

to the signal "reg" output from the read/write controller 352, 

The exception generators 356-359 generate an output error signal, eg* 347-349, 

362 when an error is detected on their inputs. The formula for calculating each output 
10 error is as aforementioned. 

The register components 360 can be defined to be of a number of types in 

accordance with requirements as previously discussed when describing the structure of 

the register set with reference to Table 5. 

3.14 CBus Structure 
15 As noted previously, the CBus (control bus) is responsible for the overall control 

of each module by way transferring information for the setting of registers within each 

module's standard CBus interface. It will be evident from the description of the standard 

CBus interface that the CBus serves two main purposes: 

1. It is the control bus that drives each of the modules. 
20 2. It is the access bus for RAMs, FIFOs and status information contained 

within each of the modules. 

The CBus uses an instruction-address-data protocol to control modules by the 

setting configuration registers within the modules. In general, registers will be set on a 

per instruction basis but can be modified at any time. The CBus gathers status and other 
25 information, and accesses RAM and FIFO data from the various modules by requesting 

data. 

The CBus is driven on a transaction by transaction basis either by: 

1. the Instruction Controller 235 (Fig. 2) when executing instructions, 

2. the External Interface Controller 238 (Fig. 2) when performing a target 
30 (slave) mode bus operation, or 

3. an external device if the External CBus Interface is so configured. 
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In each of these cases, the driving module is considered to be the source module 
of the CBus, and all other modules possible destinations. Arbitration on this bus is 
carried out by the Instruction Controller. 

The following table sets out one form of CBus signal definitions suitable for use 
5 with the preferred embodiment: 



Table 6: CBus Signal Definition 



Name 


Type 


Definition 


c_iad[31:0] 


source 


instruction-address-data 


c_valid 


source 


CBus instruction valid 


n crtoto r ^ 1 *m 

C_J5Qata[0 1 .U J 


U&o LLUOUUli 


status/read data 


c_svalid 


destination 


status/read data valid 


c_reset[15:0] 


source 


reset lines to each module 


c_active[15:0] 


destination 


active lines from each 
module 


c_background[ 1 5:0] 


destination 


background active lines 
from each module 


cjnt[15:0] 


destination 


interrupt lines from each 
module 


c_error[l5:0] 


destination 


error lines from each 
module 


cjeql,creq2 


EIQ external 


bus control request 
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Name 


Type 


Definition ? 


c_gntl, c_gnt2 


IC 


bus control grant 


c_cnd 


IC 


end nf instruction { 


elk 


global 


clock j 



A CBus c_iad signal contains the addressing data and is driver* oy the controller 
in two distinct cycles: 

5 1. Instruction cycles (c_valid high) where the CBus instruction and an 

address is driven onto c_iad; and 

2. Data cycles (cjvalid low) where data is driven onto c_iad (write 
operations) or c_sdata (read operations). 

In the case of a write operation, the data associated with an instruction is placed 
to on the cjad bus in the cycle directly following the instruction cycle. In the case of a 
read operation, the target module of the read operation drives the c_sdata signal until the 
data cycle completes. 

Turning now to Fig. 21, the bus includes a 32 bit instruction-address-data field 
which can be one of three types 370-372: 
15 1 . Type A operations (370) arc used to read and write registers and the per- 

module data areas within the co-processor. These operations can be generated by the 
external interface controller 238 performing target mode PCI cycles, by the instruction 
controller 231 configuring the co-processor for an instruction, and by the External CBus 
Interface. 

20 For these operations, the data cycle occurs in the clock cycle immediately 

following the instruction cycle. Hie data cycle is acknowledged by the designation 
module using the c_svalid signal, 

2. Type B operations (371) are used for diagnostic purposes to access any 
local memory and to generate cycles on the Generic Interface. These operations will be 

25 generated by the External Interface Controller performing target mode PCI cycles and by 
the External CBus Interface. The data cycle can follow at any time after the instruction 
cycle. The data cycle is acknowledged by the destination module using the c_svalid 
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3. Type C operations (372) are used to set individual bits within a module's 
registers. These operations will be generated by the instruction controller 231 
configuring the co-processor's for an instruction and by the External CBus Interface, 
There is no data cycle associated with a Type C operation, data is encoded in the 
5 instruction cycle. 

The type field of each instruction encodes the relevant CBus transaction type in 
accordance with the following table: 



Table 7: CBus Transaction Types 



cjad-type 
value 


transaction type 


instruction 
format type 


0000 


no-op 


A, B, C 


0001 


reserved 




0010 


peripheral interface write 


B 


0011 


peripheral interface read 


B 


0100 


generic bus write 


B 


0101 


generic bus read 


B 


0110 


local memory write 


B 


0111 


local memory read 


B 


1000 


register write 


A 


1001 


register read 


A 


1010 


module memory write 


A 


1011 


module memory read 


A 


1100 


test mode write 


A 


1101 


test mode read 


A 


1110 


bit set 


C 


1111 


reserved 





The byte field is utilized for enabling bits within a register to be set. The 
module field sets out the particular module to which an instruction on the CBus is 
addressed. The register field sets out which of the registers within a module is to be 
updated. The address field is utilized for addressing memory portions where an 
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operation is desired on those memory portions and can be utilised for addressing RAMs, 
FIFOs., etc. The ernble field enables selected hiV " : fthin a selected byte when a bit set 
instruction is utilized. The data field contains the bit wize data of the bits to be written 
to the byte selected for update. 
5 As noted previously, the CBus includes a cjacri line for each module, which i 

asserted when ever a module has outstanding activity pending. The instruction 
controller utilises these signals to determine when an instruction has completed. Further, 
the CBus contains a c_background line for each module that can operate in a background 
mode in addition to any preset, error and interrupt lines, one for each module, for resetting, 

10 detecting errors and interrupts, 

3.15 Co-processor Data Types and Data Manipulation 

Returning now to Fig, 2, in order to substantially simplify the operation of the 
co-processor unit 224, and in particular the operation of the major computational units 
within the co-processor being the JPEG coder 241 and the main data path 242, the co- 

15 processor utilizes a data model that differentiates between external formats and internal 
formats. The external data formats are the formats of data as it appears on the co- 
processor's external interfaces such as the local memory interface or the PCI bus. 
Conversely, the internal data formats are the formats which appear between the main 
functional modules of the co-processor 224. This is illustrated schematically in Fig. 22 

20 which shows the various input and output formats. The input external format 381 is the 
format which is input to the pixel organizer 246, the operand organizer B 247 and the 
operand organizer C 248. These organizers are responsible for reformatting the input 
external format data into any of a number of input internal formats 382, which may be 
inputted to the JPEG coder unit 241 and the main data path unit 242. These two 

25 functional units output data in any of a number of output internal formats 383, which are 
converted by the results organizer 249 to any of a number of required output formats 304. 

In the embodiment shown, the external data formats can be divided into three 
types. The first type Is a n packed stream" of data which consists of a contiguous stream 
of data having up to four channels per data quantum, with each channel consisting of one, 

30 two, four, eight or sixteen bit samples. This packed stream can typically represent pixels, 
data to be turned into pixels, or a stream of packed bits. The cu-proccssor is designed to 
utilize little endian byte addressing and big endian bit addressing within a byte. In Fig. 
23, there is illustrated a first example 386 of the packed stream format. It is assumed 
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that each object 387 is made up of three channels being channel 0, channel 1 and channel 
2, with two bits per channel. The layout of data for this format is as indicated 388. In a 
next example 390 of Fig. 24, a four channel object 395 having eight bits per channel is 
illustrated 396 with each data object taking up a 32 bit word. In a third example 395 of 
5 Fig. 25, one channel objects 396 are illustrated which each take up eight bits per channel 
starting at a bit address 397. Naturally, the actual width and number of channels of data 
will vary depending upon the particular application involved. 

A second type of external data format is the "unpacked byte stream" which 
consists of a sequence of 32 bit words, exactly one byte within each word being valid. 
10 An example of this format is shown in Fig. 26 and designated 399, in which a single byte 
400 is utilized within each word, 

A further external data format is represented by the objects classified as an 
"other" format. Typically, these data objects are large table-type data representing 
information such as colour space conversion tables, Huffman coding tables and the like. 
15 The co-processor utilizes four different internal data types. A first type is 

known as a "packed bytes" format which comprizes 32 bit words, each consisting of four 
active bytes, except perhaps for a final 32 bit word. In Fig. 27, there is illustrated one 
particular example 402 of the packed byte format with 4 bytes per word. 

The next data type, illustrated with reference to Fig. 28, is "pixel" format and 
20 comprises 32 bit words 403, consisting of four active byte channels. This pixel format is 
interpreted as four channel data. 

A next internal data type illustrated with reference to Fig. 29 is an "unpacked 
byte" format, in which each word consists of one active byte channel 405 and three 
inactive byte channels, the active byte channel being the least significant byte. 
25 All other internal data objects are classified by the "other" data format, 

Input data in a given external format is converted to the appropriate internal 
format. Fig. 30 illustrates the possible conversions carried out by the various organizers 
from an external format 410 to an internal format 411. Similarly, Fig. 31 illustrates the 
conversions carried out by the results organizer 249 in the conversion from internal 
30 formats 412 to external formats 413. 

The circuitry to enable the following conversions to take place are described in 
greater detail below. ^ { 
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Turning firstly to the conversion of input data external formats to internal 
fcrardts, in Fig. 32 there is shown the methodology utilized by the various orf^eys in 
the conversion process. Starting initially with the external other format 416, thus is 
merely passed through the various organizers unchanged Next, the external unpacked 
5 fr^s format 417 undergoes unpacked normalization 418 to produce a format 419 known 
as internally unpacked bytes. The process of unpacked normalization 418 involves 
discarding the three inactive bytes from an externally unpacked byte stream. The 
process of unpacked normalization is illustrated in Fig. 33 wherein the input data 417 
having four byte channels wherein only one byte channel is valid results in the output 

10 format 419 which merely comprizes the bytes themselves. 

Turning again to Fig. 32, the process of packed normalization 421 involves 
translating each component object in an externally packed stream 422 into a byte stream 
423. If each component of a channel is less than a byte in size then the samples are 
interpolated up to eight bit values. For example, when translating four bit quantities to 

15 byte quantities, the four bit quantity OxN is translated to the byte value OxNN. Objects 
larger than one byte are truncated. The input object sizes supported on the stream 422 
are 1, 2, 4, 8 and 16 bit sizes, although again these may be different depending upon the 
total width of the data objects and words in any particular system to which the invention 
is applied. 

20 Turning now to Fig. 34, there is illustrated one form of packed normalization 421 

on input data 422 which is in the form of 3 channel objects with two bits per channel (as 
per the data format 386 of Fig. 23). The output data comprizes a byte channel format 
423 with each channel "interpolated up" where necessary to comprize an eight bit sample. 
Returning to Fig. 32* the pixel streams are then subjected to either a pack 

25 operation 425, an unpacked operation 426 or a component selection operation 427. 

In Fig. 35 there is shown an example of the packed operation 425 which simply 
involves discarding the inactive byte channel and producing a byte stream, packed up with 
four active bytes per word. Hence, a single valid byte stream 430 is compressed into a 
format 431 having four active bytes per word. The unpacking operation 426 involves 

30 almost the reverse of the packing operation with the unpacked bytes being placed in the 
least significant byte of a word. This is illustrated in Fig, 36 wherein a packed byte 
stream 433 is unpacked to produce result 434. 
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The process of component selection 427 is illustrated in Fig. 37 and involves 
selecting N components from an input stream, where N is the number of input channels 
per quantum. The unpacking process can be utilized to produce "prototype pixels" eg. 
437, with the pixel channels filled from the least significant byte. Turning to Fig. 38, 
5 there is illustrated an example of component selection 440 wherein input data in the form 
436 is transformed by the component selection unit 427 to produce prototype pixel format 
437. 

After component selection, a process of component substitution 440 (Fig. 32) 
can be utilized. The component substitution process 440 is illustrated in Fig, 38 and 

10 comprizes replacing selected components with a constant data value stored within an 
internal data register 441 to produce, as an example, output components 242. 

Returning again to Fig. 32, the output of stages 425, 426 and 440 is subjected to 
a lane swapping process 444. The lane swapping process, as illustrated in Fig. 39, 
involves a byte-wize multiplexing of any lane to any other lane, including the replication 

15 of a first lane onto a second lane. The particular example illustrated in Fig. 39 includes 
the replacement of channel 3 with channel 1 and the replication of channel 3 to channels 2 
and channel 1. 

Returning again to Fig. 32, after the lane swapping step 444 the data stream can 
be optionally stored in the multi-used value RAM 250 before being read back and 
20 subjected to a replication process 446. 

The replication process 446 simply replicates the data object whatever it may be. 
In Fig. 40, there is illustrated a process of replication 446 as applied to pixel data. In this 
case, the replication factor is one. 

In Fig. 41, there is illustrated a similar example of the process of replication 
23 applied to packed byte date. 

In Fig. 42, there is illustrated the process utilized by the result organizer 249 fcr 
transferral of data in an output internal format 383 to an output external format 384. 
This process includes equivalent steps 424, 42S, 426 and 440 to the conversion process 
described in Fig. 32 Additionally, the process 450 includes the steps of component 
30 deselection 451, denormalization 452, byte addressing 453 and write masking 454. The 
component deselection process 451, as illustrated in Fig. 43, is basically the inverse 
operation of the component selection process 427 of Fig. 37 and involves the discarding 
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of unwanted data. For example, in Fig. 43, only 3 valid channels of the input are taken 
■?.nd racked into data items -56. 

The denormalization process 452 is illustrated with reference to Fig. 44 and is 
loosely the inverse operation of the packed normalization process 421 of Fig, 34. The 
5 denormaiizati >n process involves the translation of each object or data item, previously 
treated as a byte 3 to a non-byte value. 

The byte addressing process 453 of Fig. 42 dez/s with any byte wize 
reorganization that is necessary to deal with byte addressing issues. For an externally 
unpacked byte output stream, the least two significant bits of the stream's address 
10 correspond to the active stream. The byte addressing step 453 is responsible for re- 
mapping the output stream from one byte channel to another when external unpacked 
bytes are utilized (Fig. 45). Where an externally packed stream is utilized (Fig. 46), the 
byte addressing module 453 remaps the start address of the output stream as illustrated. 

The write masks process 454 of Fig. 42 is illustrated in Fig. 47 and is used to 
15 mask off a particular channel eg. 460 of a packed stream which is not to be written out. 

The details of the input and output data type conversion to be applied are 
specified by the contents of the corresponding Data Manipulation Registers: 

* The Pixel Organizer Data Manipulation Register (po_dmr) 

* The Operand Organizer B and Operand Organizer C Data Manipulation 
20 Registers (oob_dmr» ooc_dmi); 

The Result Organizer Data Manipulation Register (ro_dmr); 
Each of the Data Manipulation Registers can be set up for an instruction in one 
of two ways: 

1. They can be explicitly set using any of the standard methods for writing 
25 to the co-processor's registers immediately prior to the execution of the instruction; or 

2. They can be set up by the co-processor itself to reflect a current 
instruction. 

During the instruction decoding process, the co-processor examines the contents 
of the Instruction Word and the Data Word of the instruction to determine, amongst other 
30 things, how to set up the various Data Manipulation Registers. Not all combinations of 
the instruction and operands make sense. Several instructions have implied formats for 
some operands. Instructions that are coded with inconsistent operands may complete 
without error, although any data so generated is "undefined". If the 'S' bit of the 
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corresponding Data Descriptor is 0, the co-processor sets the Data Manipulation Register 
to reflect the current instruction. 

The format of the Data Manipulation Registers is illustrated in Fig. 48. The 
following table sets out the format of the various bits within the registers as illustrated in 



i e . AS: 


Table 8: Data Manipulation Register Format 


Field 


Description 


ls3 


Lane Swap for byte 3 (most significant byte) 


ls2 


Lane swap for byte 2 


lsl 


Lane swap for byte 1 


IsO 


Lane swap for byte 0 


ouuw>ii 


Substitution Enables 

1 = substitute data from Internal Data Register for this byte 

0 = do nol substitute data from Internal Data Register for this byte 


replicate 


Replication Count 

Indicates the number of additional data items to generate. 


wrmask 


Write Masks 

0 = write out corresponding byte channel 

1 = do not write out corresponding byte channel 


cmsb 


Choose most significant bits 

0 = choose least significant bits of a byte when performing 
denormalization (useful for halftoning operations) 

1 = choose most significant bits of a byte when performing 
denormalization (useful as inverse of input normalization) 
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Field 


Description 


normalize 


Normalization factor: represents the number of bits to be 
translated to a byte: 

0 = 1 bit data objects 

1 = 2 bit dsta objects 

2 = 4 bit data objects 

3 = 8 bit data objects 

4 = 16 bit data objects 


bo 


Bit Offset: represents the starting bit address for objects smaller 
than a byte. Bit addressing is big endian. 


P 


External Format: 

0 = unpacked bytes 

1 = packed stream 


if 


Internal Format: 

0 = pixels 

1 = unpacked bytes 

2 = packed bytes 

3 = other 


cc 


Channel count: 

Fbr the Input Organizers this defines the number of normalized 
input bytes collected to form each internal data word during 
component selection. For the Output Organizer this defines the 
number of valid bytes from the internal data word that will be sued 
to construct output data. 

0 = 4 active channels 

1 = 1 active channels 
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I 

2 = 2 active channels 

3 = 3 active channels 


L 


Immediate data: 

0 - not long: immediate data 

1 = long: pointer to data 
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Field 


Description 


what 


addressing mode: 

0 ~ instruction specific ^odc 

1 = sequential addressing 

2 = tile addressing 

3 = constant data ie, one item of internal data is produced, and this 
item is used repetitively. 



A plurality of internal and external data types may be utilized with each 
instruction. All operand, results and instruction type combinations are potentially valid, 
5 although typically only a subset of those combinations will lead to meaningful results. 
Particular operand and result data types that are expected for each instruction are detailed 
below in a first table (Table 9) summarising the expected data types for external and 
internal formats: 

10 Table 9: Expected Data Types 



I Instruction 


Operand A 


Operands 


Operand C 


Result 




(Pixel 


(Operand 


(Operand 


(Result 




Organizer) 


Organiser B) 


Organizer C) 


Organizer) 


Compositing 


ps 


px 


ps 


p*cr> 


ps 


ub 


px 


ps 










bl(B) 


nb 




ub 


ub 












const 








GCSC 


ps 


ift 


mcsc 


mcsc 


mcsc 


mcsc 








ift 




scsc 


scsc 


scsc 


scsc 












(B) 


(B) 


(B) 


(B) 






JPEG camp. 


ps 


pb 


et 


et(B) 


et(B) 


et(B) 


ub 


ps 




us 




(B) 












JPEG decomp 


ps 


pb 


fdl 


fdt 


fdt 


fdt 


pb 


ps 








sdt 


sdL (B) 


sdt (B) 


sdt 




ub 
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(B) 






(B) 






Data coding 


ps 


px 


et 


et 


ct 


et 


px 


ps 




ub 


pb 


fdt 


fdt 


fdt 


fdt 


pb 


ub 






uu 


MJL 


out I -L* J 




sdt 


ub 










(B) 






(B) 






Transformations 


skd 


skd 


it(B) 


it(D) 


it(B) 


it(B) 


px 


ps 


and Convolutions 


lkd 


lkd 












ub 


Matrix 


ps 


px 


mm 


mm 


mm 


mm(B 


px 


ps 


Multiplication 


ub 




(B) 


(B) 


(B) 


) 




ub 



Instruction 


Operand A 


Operand B 


Operand C 


Result 




(Pixel 


(Operand 


(Operand 


(Result 




Organizer) 


Organizer B) 


Organizer C) 


Organizer) 


Halftoning 


ps 


px 


ps 


px 






px 


ps 




ub 


pb 


ub 


pb 






pb 


ub 






ub 




ub 






ub 




Hierarchial Image: 


ps 


px 










px 


ps 


horizontal 


ub 


pb 










pb 


ub 


interpolation 




ub 










ub 




Hierarchial Image: 


ps 


px 


ps 


px 






px 


PS 


vertical interpolation 


ub 


pb 


ub 


pb 






pb 


ub 


and residual merging 




ub 




ub 






ub 




General Memory 


PS 


P* 










px 


ps 


Copy 


ub 


pb 










pb 


ub 






ub 










ub 




Peripheral DMA 


















Internal Access 


















Flow Control 
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The symbols utilized in the above table are as follows: 



Table 10: Symbol Explanation 



Symbol 


Explanation 




packed stream 


pb 


packed bytes 


ub 


unpacked b ? tes 


P* 


pixels 


bl 


blend 


const 


constant 


mcsc 


4 output channel 


scsc 


1 output channel color conversion table 


if. 


Interval and fraction tables 


et 


JPEG encoding table 


fdt 


fast JPEG decoding table 


sdt 


slow JPEG decoding table 


skd 


short kernel descriptor 


lkd 


long kernel descriptor 


mm 


matrix co-efficient table 




Symbol 

It 


Explanation | 


image table 


(B) 


this organizer in bypass mode for this operation 


CO 


operand may tile 




no data flows via this operand | 



3.16 Data Normalization Circuit 

Referring to Fig. 49, there is shown a computer graphics processor having three 
main functional blocks: a data nonnalizer 1062 which may be implemented in each of 
the pixel organizer 246 and operand organizers B and C 247, 248, a central graphics 
engine in the form of the main data path 242 or JPEG nnits 241 and a programming agent 
1064, in the form of an instruction controller 235. The operation of the data nonnalizer 
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1062 and the central graphics engine 1064 is determined by an instruction stream 1066 
that is provided to the programming agent 1064. For each instruction, the programming 
agent 1064 performs a decoding function and outputs internal control signals 1067 and 
1068 to the other blocks in the system. For each input data word 1069, the normaiizer 

5 1062 v* ;-! format the data according to the current instruction ard pass the r.:sul! to the 
central graphics engine 1063, where further processing is performed. 

The data normaiizer represents, in a simplified form, the pixel organizer and the 
operand organizers B and C. Each of these organizers implements the data 
normalization circuitry, thereby enabling appropriate normalization of the input data prior 

10 to it passing to the central graphics engine in the form of the JPEG coder or the main data 
path. 

The central graphics engine 1063 operates on data that is in a standard format, 
which in this case is 32-bit pixels. The normaiizer is thus responsible for converting its 
input data to a 32-bit pixel format The input data words 1069 to the normaiizer are also 

1 5 32 bits wide, but may take the form of either packed components or unpacked bytes. A 
packed component input stream consists of consecutive data objects within a data word, 
the data objects being 1,2,4,8 or 16 bits wide. By contrast, an unpacked byte input 
stream consists of 32-bit words of which only one 8-bit byte is valid. Furthermore, the 
pixel data 11 produced by the normaiizer may consist of 1,2,3 or 4 valid channels, where 

20 a channel is defined as being 8 bits wide. 

Turning now to Fig. SO, there is illustrated in greater detail a particular hardware 
implementation of the data normaiizer 1062. The data normalization unit 1062 is 
composed of the following circuits: a First-In-First-Out buffer (FIFO) 1073, a 32-bit input 
register (REG1) 1074, a 32-bit output register (REG2) 1076, normalization multiplexors 

25 1075 and a control unit 1076. Each input data word 1069 is stored in (he FIFO 1073 and 
is subsequently latched into REG1 1074, where it remains until all its input bits have been 
converted into the desired output format. The normalization multiplexors 1075 consist 
of 32 combinatorial switches that produce pixels to be latched into REG2 by selecting bits 
from the value in REG1 1074 and the current output of the FIFO 1073. Thus the 

30 normalization multiplexors 1075 receive two 32-bit input words 1077, 1078, denoted as 
x[63..32j and x[31..0]. 



It has been found that such a method improves the overall throughput of the 
apparatus, especially when the FIFO contains at least two valid data words during the 
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course of an instruction. This is typically due to the way in which data words originally 
fetched from memory. In some cases, a desired data word or object may ;■ ^ r pre,jc- 
across or "wrapped" into a pair of adjacent input data words in the FIFO buffer. By 
using an additional input register 1074, the normalization multiplexers can reassemble a 
5 com.p'cte input data word using components from adjacent data words in the ft po buffer 
thereby avoiding need for additional storage or bit-stripping operations prior to the main 
n- : 3 r < t * * ;<%n _ ctg . This ;:~nri*errv:?:r particularly advantageous v r here multiple 
data words ot a similar type are inputted to the normalizer. 

The control unit generates enable signals REG1_EN 20 and REG2_EN[3..0] 

10 1081 for updating REG1 1074 and REG2 1076, respectively, as well as signals to control 
the FIFO 1073 and normalization multiplexors 1075. 

The programming agent 1064 in Fig. 49 provides the following configuration 
signals for the data normalizer 1062: a FIFO_WR 4 signal, a normalization factor n[2..0], 
a bit offset b[2..0], a channel count c[l~0] and an external format (E)« Input data is written 

15 into the FIFO 1073 by asserting the FIFO„WR signal 1085 for each clock cycle that valid 
data is present. The FIFO asserts a fifbjull status flag 1086 when there is no space 
available. Given 32-bit input data, the external format signal is used to determine 
whether the input is in the format of a packed stream (when E=l) or consists of unpacked 
bytes (when E=0). For the case when E=l, the normalization factor encodes the size of 

20 each component of a packed stream, namely: n=0 denotes 1-bit wide components, n=l 
denotes 2 hits per component, n=2 denotes 4 bits per component, n=3 denotes 8-bit wide 
components and n>3 denotes 16-bit wide components. The channel count encodes the 
maximum number of consecutive input objects to format per clock cyde in order to 
produce pixels with the desired number of val id bytes. In particular, c=l yields pixels with 

25 only the least significant byte valid, c=2 denotes least significant 2 bytes valid, c=3 
denotes least significant 3 bytes valid and c=0 denotes all 4 bytes valid 

When a packed stream consists of components that are less than 8 bits wide, the 
bit offset determines the position in x[31..0], the value stored in REG1, from which to 
begin processing data. Assuming a bit offset relative to the most significant bit of the first 

30 input byte, the method for producing an output data byte y[7..0] is described by the 
following set of equations :- 

where n = 0 : . ■ , 
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y[i] = x[7-b], 



84 



where 0 i <= 7 



where n = 1 : 
yp] = xTO 

y{i] = x[6-b], 



where 
where 



i= 1,3,5,7 
i = 0,2,4,6 



10 



15 



where n = 2 : 

y[3] = x[7-b] 

y[2] = x[6-b] 

y[l] = x[5-b] 

y[0] = x[4-b] 

y[7] = y[3] 

y[6] =y[2] 

y[51 =yfi] 

y[4] =y[0] 



where n = 3 : 



where 



<= i <= 7 



20 



where n > 3 : 

y[7..0] = x[15..8] 



Corresponding equations may be used to generate output data bytes y[l5.&), 
y[23„16] and y[31.24]. 

25 The above method may be generalized to produce an output array of any length 

by taking each component of the input stream and replicating it as many times as 
necessary to generate output objects of standard width. In addition, the order of 
processing each input component may be defined as little-endian or big-endian. The 
above example deals with big-endian component ordering since processing always begins 

30 from the most significant bit of an input byte. Little-endian ordering requires 
redefinition of the bit offset to be relative to the least significant bit of an input byte. In 
situations where the input component width exceeds the standard output width, output 
components arc generated by truncating each input component, typically by removing a 
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suitable number of the least significant bits. In the above set of equations, truncation of 
16-b?r input components to form 8-bit wide standard output is performed by selecting the 
most significant byte of each 16-bit data object. 

The control unit of Fig. 50 performs the decoding of n[2„0] and c[L.0], and uses 
5 the r... - :ong with ""?~.Q] and E to provide the select signals for the normaliza*. . 
multiplexors and the enable signals for REG1 and REG2. Since the FIFO may become 

f-'^r ^ - ^g the cour. r '- : -- ■' i -"r^;on, * r ' -y^-ril unit * -r.->- ■■- .- 

record the current bit position, in_bit[4..0], in REG1 from which to select input data, and 
the current byte, out_byte[1..0], in REG2 to begin writing output data. The control unit 

10 detects whea it has completed processing each input word by comparing the value of 
in_bit[4..0] to the position of the final object in REG1, and initiates a FIFO read operation 
by asserting the FTFOJRD signal for one clock cycle when the FIFO is not empty. The 
signals fifo_empty and fifo_full denote the FIFO status flags, such that fifo_empty=l 
when the FIFO contains no valid data, and fifo_full=l when the FIFO is full. In the same 

15 clock cycle that FIPO_RD is asserted, REG1_EN is asserted so that new data are captured 
into REG1. There are 4 enable signals for REG2, one for each byte in the output register. 
The control unit calculates REG2_EN[3..0] by taking the minimum of the following 3 
values : the decoded version of c[1..0], the number of valid components remaining to be 
processed in REG1, and the number of unused channels in REG2. When E=0 there is 

20 only one valid component in REG1. A complete output word is available when the 
number of channels that have been filled in REG2 is equal to the decoded version of 
c[1..0]. 

In a particularly preferred embodiment of the invention, the circuit area occupied 
by the apparatus in Fig. 50 can be substantially reduced by applying a truncation function 
25 to the bit offset parameter, such that only a restricted set of offsets axe used by the control 
unit and normalization multiplexors. The offset truncation depends upon the 
normalization factor and operates according to the following equation: 
b_trunc[2. .0] = 0, where n >= 3 

= b[2..0], where n = 0 
30 =b[2..1]&"0", wheren = l 

= b[2]&'W, where n = 2 
(Note that denotes bitwize concatenation). 
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The above method allows each of the normalization multiplexors, denoted in Fig. 
50 by MUXO, MUX1 ... MUX31, to be reduced from 32-to-l in size when no truncation 
is applied, to be a maximum size of 20-to-l with bit offset truncation. The size reduction 
in turn leads to an improvement in circuit speed. 
5 It can be seen from the foregoing that the preferred embodiment provides an 

efficient circuit for the transformation of data into one of a few normalized forms. 
3.17 Image Processing Operations of Accelerator Card 

Returning again to Fig. 2 and Table 2, as noted previously, the instruction 
controller 235 "executes" instructions which result in actions being performed by the oo- 

10 processor 224. The instructions executed include a number of instructions for the 
performance of useful functions by the main data path unit 242. A first of these useful 
instructions is compositing. 
3.17.1 Compositing 

Referring now to Fig. 51, there is illustrated the compositing model implemented 

15 by the main data path unit 242. The compositing model 462 generally has three input 
sources of data and the output data or sink 463. The input sources can firstly include 
pixel data 464 from the same destination within the memory as the output 463 is to be 
written to. The instruction operands 465 can be utilized as a data source which includes 
the color and opacity information. The color and opacity can be either flat, a blend, 

20 pixels or tiled. The flat or blend is generated by the blend generator 467, as it is quicker 
to generate them internally than to fetch via input/output. Additionally, the input data 
can include attenuation data 466 which attenuates the operand data 465. The attenuation 
can be flat, bit map or a byte map. 

As noted previously, pixel data normally consists of four channels with each 

25 channel being one byte wide. The opacity channel is considered to be the byte of highest 
address. For an introduction to the operation and usefulness of compositing operations, 
reference is made to the standard texts including the seminal paper by Thomas Porter and 
Tom Duff "Compositing Digital Images' 1 in Computer Graphics, Volume 18, Number 3, 
July 1984. 

30 The co-processor can utilize pre-multiplied data. Pre-multiplication can consist 

of pre-multiplying each of the colored channels by the opacity channel. Hence, two 
optional pre-multiplication units 468, 469 are provided for pre-multiplying the opacity 
Channel 470, 471 by the colored data to form, where required, pre-multiplied outputs 472, 
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473. A compositing unil 475 implements a composite of its two inputs in accordance 
with the current instruction data. The compositing operators axe illustrated in Table 11 
below: 

5 TV . 11: ?omposlting Operations 



Operator 


Definition 


-o) over "^"co^o) 




(s C o> a o) ™ (b C o> b o) 


(acoBo^o) 


(*co a o) out ( b co b o) 


(acoCl-boJ^Cl-bo)) 


(^o^o) ato P fccoA)) 


(a C0 b 0 +b C0 (l-a o ),b o ) 




Operator 


Definition 


(*Wkj) OT <bco- b o) 


ao)) 


( a co> a o) P lus ( b co> b o) 


(wcfaco+bco-^ao+bQ- 

255)/255)+r(clainp(a 0 +b 0 )- 

255)/255,clamp(a 0 +b 0 )) 


(flco^) loadzero (b^.b^ 


(0,0) 


(a^^JloadcCbco^o) 


0>co>*o) 


(*cq>*q) loado fcco^o) 


( a cc b o) 


0w» u ) loadco 0>co' b o) 





The nomenclature (a^, a^ refers to a pie-multiplied pixel of color Bq and 
opacity a^. R is an offset value and "wc r is a wrapping/clamping operator whose 
10 operation is explained below. It should be noted that the reverse operation of each 
operator in the above table is also implemented by a composting unit 475. 

A clamp/wrapping unit 476 is provided to clamp or wrap data around the limit 
values 0-255, Further, the data can be subjected to an optional "unpre-multiplication" 
477 restoring the original pixel values as required. Finally, output data 463 is produced 
15 for return to the memory. 
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In Fig. 52, there is illustrated the form of an instruction word directed to the main 
data path unit for composting operations. When the X field in the major op-code is 1, 
this indicates a plus operator is to be applied in accordance with the aforementioned table. 
When this field is 0, another instruction apart from the plus operator is to be applied. 
5 The P a field determines whether or not to pre-muitiply the first data stream 464 (Fig. 51). 

The P^ field determines whether or not to pre-multiply the second data stream 465. The 

P r field determines whether or not to "unpre-multiply" t!\,, : iv u'iilisk.g unit 477. The 

C field determines whether to wrap or clamp, overflow or underflow in the range 0-255. 
The "com-code" field determines which operator is to be applied. The plus operator 

10 optionally utilizes an offset register (mdp jror). This offset is subtracted from the result 
of the plus operation before wrapping or damping is applied. For plus operators, the 
corn-code field is interpreted as a per channel enablement of the offset register. 

The standard instruction word encoding 280 of Fig. 10 previously discussed is 
altered for composting operands. As the output data destination is the same as the 

15 source, operand A will always be the same operand as the result word so operand A can 
be utilized in conjunction with operand B to describe at greater length the operand B. As 
with other instructions, the A descriptor within the instructions still describes the format 
of the input and the R descriptor defines the format of the output. Turning now to Fig. 
53, there is illustrated in a first example 470, the instruction word format of a blend 

20 instruction. A blend is defined to have a start 471 and end value 472 for each channel. 
Similarly, in Fig. 54 there is illustrated 475 the format of a tile instruction which is 
defined by a tile address 476 a start offset 477, a length 478. All tile addresses and 
dimensions are specified in bytes. Tiling is applied in a modular fashion and, in Fig. 55, 
there is shown the interpretation of the fields 476 - 478 of Fig. 54. Hie tile address 476 

25 denotes the start address in memory of the tile. A tile start offset 477 designates the first 
byte to be utilized as a start of the tile. The tile length 478 designates the total length of 
the tile for wrap around. 

Returning to Fig. 51, every color component and opacity can be attenuated by an 
attenuation value 466. Hie attenuation value can be supplied in one of three ways: 



30 1. Software can specify a flat attenuation by placing the attenuation factor 

in the operand C word of the instruction. 
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2. A bit map attenuation where 1 means fully on and 0 means fully off can 

be uli1r: x:d with software specifying the address of the bit map in ihu: operand C word of 
the instruction. 

3. Alternatively, a byte map attenuation can be provided again with the 

5 accrc of the byl - -p in operand C. 

Since the attenuation is interpreted as an unsigned integer from 0-255, the pre- 
mulliplicd color channel is multiplied by the attenuation factor by effectively calculating: 

c oa = c oa xA / 255 

Where A is the attenuation and Cq is the pre-multiplied color channel. 

i o 3.1 7.2 Color Space Conversion Instructions 

Returning again to Fig. 2 and Table 2, the main data path unit 242 and data cache 
230 are also primarily responsible for color conversion. The color space conversion 
involves the conversion of a pixel stream in a first color space format, for example 
suitable for ROD color display, to a second color space format, for example suitable for 

15 CYM or CYMK printing. The color space conversion is designed to work for all color 
spaces and can be used for any function from at least one to one or more dimensions. 

The instruction controller 235 configures, via the Cbus 231, the main data path 
unit 242, the data cache controller 240, the input interface switch 252, the pixel organizer 
246, the MUV buffer 250, the operand organizer B 247, the operand organizer C 248 and 

20 the result organizer 249 to operate in the color conversion mode. In this mode, an input 
image consisting of a plurality of lines of pixels is supplied, one line of pixels after 
another, to the main data path unit 242 as a stream of pixels. The main data path unit 
242 (Fig. 2) receives the stream of pixels from the input interface switch 252 via the pixel 
organizer 246 for color space conversion processing one pixel at a time* In addition, 

25 interval and fractional tables are pre-loaded into the MUV buffer 250 and color 
conversion tables are loaded into the data cache 230. The main data path unit 242 
accesses these tables via the operand organizers B and C, and converts these pixels, for 
example from the RGB color space to the CYM or CYMK color space and supplies the 
converted pixels to the result organizer 249. The main data path unit 242, the data cache 

30 230, the data controller 240 and the other abovementioned devices are able to operate in 
either of the following two modes under control of the instruction controller 235; a Single 
Output General Color Space (SOGCS) Conversion mode or a Multiple Output General 
Color Space (MOGCS) Conversion Mode. For more details on the data cache controller 
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240 and data cache 230, reference is made to the section entitled Data Cache Controller 
and Cache 240, 230 (Fig. 2). 

Accurate color space conversion can be a highly non-linear process. For 
example, color space conversion of a RGB pixel to a single primary color component (e.g. 
5 cyan) of the CYMK color space is theoretically linear, however in practice non-linearities 
are introduced typically by the output device which is used to display the colour 
components of the pixel. Similarly for the color space conversion of the- RGB pixel to 
the other primary color components (yellow, magenta or black) of the CYMK color space* 
Consequently a non-linear colour space conversion is typically used to compensate for the 
10 non-linearities introduced on each colour component. The highly non-linear nature of 
| the color conversion process requires either a complex, transfer function to be 

! implemented or a look-up table to be utilized. Given an input color space of, for 

example, 24 bit RGB pixels, a look-up table mapping each of these pixels to a single 8 bit 
primaiy color component of the CYMK color space (i.e. cyan) would require over 16 
15 megabytes. Similarly, a look-up table simultaneously mapping the 24 bit RGB pixels to 

i 

all four 8 bit primary color components of the CYMK color space would require over 64 
megabytes, which is obviously excessive. Instead, the main data path 242 (Fig. 2) uses a 
look-up table stored in the data cache 230 having sparsely located output color values 
corresponding to points in the input color space and interpolates between the output color 

20 values to obtain an intermediate output. 

a. Single Output General Color Space (SOGCS) Conversion Mode 

In both the single and multiple output color conversion modes (SOGCS) and 
(MOGCS), the RGB color space is comprized of 24 bit pixels having 8 bit red, green and 
blue color components. Each of the RGB dimensions of the RGB color space is divided 

25 into 15 intervals with the length of each interval having a substantially inverse 
proportionality to the non-linear behavior of the transfer function between the RGB to 
CYMK color space of the printer. That is, where the transfer function has a highly non- 
linear behavior the interval size is reduced and where the transfer function has a more 
linear behavior, the size of the interval is increased. Preferably, the color space of each 

30 output printer is accurately measured to determine those non-linear portions of its transfer 
function. However, the transfer function can be approximated or modelled based on 
know-how or measured characteristics of a type printer (e.g.: ink-jet). For each color 
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15 intervals. Two tables are used by the main data path unit 242 to determine which 
interval a particular input color component value lies within and also to determine a 
fraction along the interval in which a particular input color component value lies. Of 
course, different tables may be used for output printers having different transfer functions. 

5 As noted previously, each uf (he RGB dimensions is divided into 15 interval 

In this way the RGB color space forms a 3-dimcnsional lattice of intervals and the input 
pixels at the ends of the intervals form sparsely located points in the input color space. 
Further, only the output color values of the ouiput color space corresponding to mc 
endpoints of the intervals are stored in look-up tables. Hence, an output color value of 

10 an input color pixel can be calculated by determining the output color values 
corresponding to the endpoints of the intervals within which the input pixel lies and 
interpolating such output color values utilising the fractional values. This technique 
reduces the need for large memory storage, 

Turning now to Fig. 56, there is illustrated 480 an example of determining for a 

15 particular input RGB color pixel, the corresponding interval and fractional values. The 
conversion process relies upon the utilization of an interval tabic 482 and a fractional 
table 483 for each 8 bit input color channel of the 24 bit input pixel. The 8 bit input 
color component 481, shown in a binary form in Fig. 56 having the example decimal 
number 4, is utilized as a look-up to each of the interval and fractional tables. Hence, 

20 the number of entries in each table is 256. The interval table 482 provides a 4 bit output 
defining one of the intervals numbered 0 to 14 into which Che input color component 
value 481 falls. Similarly, the fractional table 483 indicates the fraction within an 
interval that the input color value component 481 falls. The fractional table stores 8 bit 
values in the range of 0 to 255 which arc interpreted as a fraction of 256. Hence, for an 

25 input color value component 481 having a binary equivalent to the decimal value 4, this 
value is utilized to look-up the interval table 482 to produce an output value of 0. The 
input value 4 f s also utilized to look-up the fractional table 483 to produce an output value 
of 160 which designates the fraction 160/256. As can be seen from the interval and 
fractional tables 482 and 483, the interval lengths are not equal. As noted previously, 

30 the length of the intervals are chosen according to the non-linear behavior of the transfer 
function. 

As mentioned above, the separate interval and fractional tables are utilized for 
each of the RGB color components resulting in three interval outputs and three fractional 
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outputs. Each of the interval and fractional tables for each color component are loaded 
in the MUV buffer 250 (Fig. 2) and accessed by the main data path unit 242 when 
required. The arrangement of the MUV buffer 250 for the color conversion process is as 
shown in Fig. 57. The MUV buffer 250 (Fig. 57) is divided into three areas 488, 489 

5 and 490, one area for each color component. Each area e.g. 488 is further divided into a 
4 bit interval table and a 8 bit fractional table. A 12 bit output 492 is retrieved by the 
main data path unit 242 from the MUV buffer 250 for each input color channel. In the 
example given above of a single input color component having a decimal value 4, the 12 
bit output will be 000001010000. 

10 Turning now to Fig. 58, there is illustrated an example of the interpolation 

process. The interpolation process consists primarily of interpolation from one three 
dimensional space 500, for example RGB color space to an alternative color space, for 
example CMY or CMYK. The pixels P0 to P7 form sparsely located points in the RGB 
input color space and having corresponding output color values CV(P0) to CV(P7) in the 

15 output color space. The output color component value corresponding to the input pixel 
Pi falling between the pixels P0 to P7 is determined by; firstly, determining the endpoints 
PO, P1....JP7 of the intervals surrounding the input pixel Pi; secondly, determining the 
fractional components frac_r, frac_g and frac_b; and lastly interpolating between the 
output color values CV(P0) to CV(P7) corresponding to the endpoints PO to P7 using the 

20 fractional components. 

The interpolation process includes a one dimensional interpolation in the red (R) 
direction to calculate the values temp 11, temp 12, temp 13, temp 14 in accordance with 
the following equations: 

25 temp 11 = CV(P0) + fiacr (CV(P1) - CV(PO)) 

temp 12 = CV(P2) + fracr (CV(P3) - CV(P2)) 
temp 13 = CV(P4) 4- fracr (CV(P5) - CV(P4)) 
temp 14 = CV(P6) + fracr (CV(P7) - CV(P6)) 

30 Next, the interpolation process includes the calculation of a further one dimensional 
interpolation in the green (G) direction utilising the following equations to calculate the 
values temp 21 and temp 22; 
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temp 21 = temp 11 + frac_g (temp 12 - temp 11) 
temp 22 = temp 13 -f frac_g (temp 14 - temp 13) 

Finally, the final dimension interpolation in the blue (B) direction is carried out 

5 to Cilculai. i.aal color ok; value accordant- with the following equation. 

fina! = te\np 21 + f rag__b (temp 22 - temp 21) 

Unfortunately, it is often the case that the input and output gamut may not match. 
In this respect, the output gamut may be more restricted that the input gamut and in this 
case, it is often necessaiy to clamp the gamut at the extremes. This often produces 
unwanted artefacts when converting using the boundary gamut colors. An example of 
how this problem can occur will now be explained with reference to Fig. 59, which 
represents a one dimensional mapping of input gamut values to output gamut values. It 
is assumed that output values are defined for the input values at points 510 and 511. 
However, if the greatest output value is clamped at the point 512 then the point 511 must 
have an output value of this magnitude. Hence, when interpolating between the two 
points 510 and 511, the line 515 forms the interpolation line and the input point 516 
produces a corresponding output value 517. However, this may not be the best color 
mapping, especially where, without the gamut limitations, the output value would have 
been at the point 518. The interpolation line between 510 and 518 would produce an 
output value of 519 for the input point 516. The difference between the two output 
values 517 and 519 can often lead to unsightly artefacts, particularly when printing edge 
of gamut colors. To overcome this problem, the main data path unit can optionally 
calculate in in expanded output color space and then scale and clamp to the appropriate 
range utilising the following formula: 

0 if *c63 

out = 2(x-64) if (64sx<191) (4) 

30 255 if <192<x) 

Returning now to Fig, 58, it will be evident that the interpolation process can 
either be carried out in the SOCGS conversion mode which converts RGB pixels to a 
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single output color component (for example, cyan) or the MOGCS mode which converts 
RGB pixels to all the output color components simultaneously. Where color conversion 
is to be carried out for each pixel in an image, many millions of pixels may have to be 
independently color converted. Hence, in order for high speed operation, it is desirable 
to be able to rapidly locate the 8 values (P0-P7) around a particular input value. 

As noted previously with respect to Fig. 57, the main data path unit 242 retrieves 
for each color input channel, a 12 bit output consisting of a 4 bit interval part and a 8 bit 
fractional part. The main data path unit 242 concatenates these 4 bit interval parts of the 
red, green and blue color channels to form a single 12 bit address (Ir, Iq, Ib)> ^ shown in 
Fig. 60 as 520. 

Fig. 60 shows a data flow diagram illustrating the manner in which a single 
output color component 563 is obtained in response to the single 12 bit address 520. 
The 12 bit address 520 is first fed to an address generator of the data cache controller 240, 
such as the generator 1881 (shown in Fig. 141) which generates 8 different 9 bit line and 
byte addresses 521 for memory banks (Br> Bi,...B7)- The data cache 230 (Fig. 2) is 
divided into 8 independent memory banks 522 which can be independently addressed by 
the respective 8 line and byte addresses. The 12 bit address 520 is mapped by the 
address generator into the 8 line and byte addresses in accordance with the following 
table: 



TaUe 12: Address Composition for SOGCS Mode 





Bit [8:6] 


Bit [5:3] 


Bit [2:0] 


Bank 7 


R[3:l] 


G(3:l] 


B[3:l] 


Bank 6 


R[3:l] 


G[3:l] 


B[3:l]+B[0] 


BankS 


R[3:l] 


G[3:l]+G[0] 


B[3:l] 


Bank 4 


R[3:l] 


G[3:l]+G[0] 


B[3:l]+B[0] 


Bank 3 


R[3:l]+R[0] 


G(3:l] 


B[3:l] 


Bank 2 


R[3:l]+R[0] 


G[3:l] 


B[3:l]+B[0] 


Bank 1 


R[3:l]+R[0] 


G[3:l]+G[0] 


B[3:l] 


BankO 


R[3:l]+R[0] 


G[3:l]+G[0] 


B[3:l]+B[0] 
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where BrT[8:6], BIT[5:3] and BIT[2:0] represent the sixth to eighth bits, the 
third to fifth bits and the zerc to second bite of the 9 bit bank addresses respectively; and 

where R[3:l], G[3:l] and B[3:l] represent the fust to third hits of the 4 bit 
intervals Ir, Iq and lg of the 12 bit address 520 respectively. 
5 Reference is made to memory bank 5 of Table 12 for a more detailed explam'tl.: 

of the 12 bit to 9 bit mapping. In this particular case, the bits 1 to 3 of the 4 bit red 
interval l r of the 12 bit address 520 are mapped to bits 6 to 8 of the 9 bit address B5; bits 
1 to 3 and bit 0 of the 4 bit green interval Ig are summed and then mapped to bits 3 to 5 of 
the 9 bit address B5; and bits 1 to 3 of the 4 bit blue interval I5 are mapped to bits 0 to 2 

10 of the 9 bit address B5. 

Each of the 8 different line and byte addresses 521 is utilized to address a 
respective memory bank 522 which consists of 512 x 8 bit entries, and the corresponding 
8 bit output color component 523 is latched for each of the memory banks 522, As a 
consequence of this addressing method, the output color values of CV(P0) to CV(P7) 

15 correseponding to the endpoints P0 to P7 may be located at different positions in the 
memory banks. For example, a 12 bit address of 0000 0000 0000 will result in the same 
bank address for each bank, ic 000 000 000, However a 12 bit address of 0000 0000 
0001 will result in different bank addresses, ie a bank address of 000 000 000 for banks 7, 
5, 3 and 1 and a bank address of 000 000 001 for banks 6, 4, 2 and 0. It is in this way the 

20 eight single output color values CV(PO)-CV(P7) surrounding a particular input pixel 
value are simultaneously retrieved from respective memory banks and duplication of 
output color values in the memory banks can be avoided. 

Turning now to Fig. 61, there is illustrated the structure of a single memory bank 
of the data cache 230 when utilized in the single color conversion mode. Each memory 

25 bank consists of 128 line entiles 531 which are 32 bits long and comprize 4x8 bit 
memories 533 - 536. The top 7 bits of the memory address 521 are utilized to determine 
the corresponding row of data within the memory address to latch 542 as the memory 
bank output Hie bottom two bite are a byte address and are utilized as an input to 
multiplexer 543 to determine which of the 4x8 bit entries should be chosen 544 for output. 



30 One data item is output for each of the 8 memory banks per clock cycle for return to the 
main data path unit 242. Hence, the data cache controller receives a 12 bit byte address 
from the operand organizer 248 (Fig. 2) and outputs in return to the operand organizers 
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247, 248, the 8 output color values for interpolation calculation by the main data path unit 

242. 

Returning to Fig. 60, the interpolation equations are implemented by the main 
data path unit 242 (Fig, 2) in three stages. In the main data path unit, a first stage of 

5 multiplier and adder units eg. 550 which take as input the relevant color values output by 
the corresponding memory banks eg. 522 in addition to the red fractional component 551 
and calculate the 4 output values In accordance with stage 1 of the abovementkmed 
equations. The outputs eg. 553, 554 of this stage are fed to a next stage unit 556 which 
utilizes the fracjg input 557 to calculate an output 558 in accordance with the 

10 aforementioned equation for stage 2 of the interpolation process. Finally, the output 558 
in addition to other outputs eg. 559 of this stage are utilized 560 in addition to the fracj) 
input 562 to calculate a final output color 563 in accordance with the aforementioned 
equations. 

The process illustiated in Fig. 60 is implemented in a pipelined manner so as to 

15 ensure maximum overall throughput. Further, the method of Fig. 60 is utilized when a 
single output color component 563 is required. For example, the method of Fig. 60 can 
be utilized to first produce the cyan color components of an output image followed by the 
magenta, yellow and black components of an output image reloading the cache tables 
between passes. This is particularly suitable for a four-pass printing process which 

20 requires each of the output colors as part of separate pass, 
b. Multiple Output General Color Space Mode 

The co-processor 224 operates in the MOGCS mode in a substantially similar 
manner to the SOCGS mode, with a number of notable exceptions. In the MOGCS 
mode, the main data path unit 242, the data cache controller 240 and data cache of Fig. 2 

25 co-operate to produce multiple color outputs simultaneously with four primary colors 
components being output simultaneously. This would require the data cache 230 to be 
four times larger in size. However, In the MOGCS mode of operation, in order to save 
storage space, the data cache controller 240 stores only one quarter of all the output color 
values of the output color space. The remaining output color values of the output color 

30 space are stored in a low speed external memory and arc retrieved as required. This 
particular apparatus and method is based upon the surprising revelation that the 
implementation of sparsely located color conversion tables in a cache system have an 
extremely low miss rate. This is based on the insight there is a low deviation in color 
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values from one pixel to the next in most color images. In addition, there is a high 

probability the sparsely located oiuput color values will be the same for neighbor: r<~ 
pixels. 

Turning now to Fig. 62 there will now be described the method carried out by the 

5 co-proc . or to implement multi-channel cached color conversion. Each input pixel ; 
broken into its color components and a corresponding interval tabic value (Fig. 56) is 
determined as previously described resulting in the three 4 bit intervals Ir, fg, Fb denoted 
570. The combined 12 bit number 570 is utilized in conjunction with the 
aforementioned tabic 12 to again derive eight 9-bit addresses. The addresses eg. 572 are 

10 then re-mapped as will be discussed below with reference to Fig. 63, and then are utilized 
to look up a corresponding memory bank 573 to produce four colour output channels 574. 
The memory bank 573 stores 128x32 bit entries out of a total possible 512x32 bit entries. 
The memory bank 573 forms part of the data cache 230 (Fig. 2) and is utilized as a cache 
as will now be described with reference to Fig. 63. 

15 Turning to Fig, 63, the 9 bit bank input 578 is re-mapped as 579 so as to anti- 

alias memory patterns by re-ordering the bits 580-582 as illustrated. This reduces the 
likelihood of neighboring pixel values aliasing to the same cache elements. 

The reorganized memory address 579 is then utilized as an address into the 
corresponding memory bank eg. 585 which comprizes 128 entries each of 32 bits. The 7 

20 bit line address is utilized to access the memory 585 resulting in the corresponding output 
being latched 586 for each of the memory banks. Each memory bank, eg 585 has an 
associated tag memory which comprizes 128 entries each of 2 bits. The 7 bit line 
address is also utilized to access the corresponding tag in tag memory 587. The two 
most significant bits of the address 579 are compared with the corresponding tag in tag 

25 memory 587 to determine if the relevant output color value is stored in die cache. These 
two most significant bits of the 9 bit address correspond to the most significant bits of the 
red and green data intervals (see Table 12). Thus in the MOGCS mode the RGB input 
color space is effectively divided into quadrants along the red and green dimensions 
where the two most significant bits of the 9 bit address designates the quadrant of the 

30 RGB input color space. Hence the output color values are effectively divided into four 
quadrants each designated by a two bit tag. Consequently the output color values for 
each tag value for a particular line are highly spaced apart in the output color space, 
enabling anti-aligning of memory patterns. 
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Where the two bit tags do not match a cache miss is recorded by the data cache 
. controller and the corresponding required memory read is initiated by the data cache 
controller with the cache look up process being stalled until all values for that line 
corresponding to that two bit tag entry are read from an external memory and stored in the 
5 cache. This involves the reading of the relevant line of the color conversion table stored 
in the external memory. The process 575 of Fig. 63 is carried out for each of the 
memory banks eg. 573 of Fig. 62 resulting, depending on the cache contents, in a time 
interval elapsing before the results eg. 586 are output from each corresponding memory 
bank. Each of the eight 32 bit sets of data 586 are then forwarded to the main data path 
10 unit (242) which carries out the aforementioned interpolation process (Fig. 62) in three 
stages 590 - 592 to each of the colored channels simultaneously and in a pipelined manner 
so as to produce four color outputs 595 for sending to a printer device. 

Experiments have shown that the caching mechanism as described with reference 
to Figs. 62 and 63 can be advantageously utilized as typical images have a cache miss-rate 
]5 on average requiring between 0.01 and 0.03 cache line fetches per pixel. The utilization 
of the caching mechanism therefore leads to substantially reduced requirements, in the 
typical case, for memory accesses outside of the data cache. 

The instruction encoding for both color space conversion modes (Fig. 10) 
utilized by the co-processor has the following structure: 

20 



Table 12A: Instruction Encoding for Color Space Conversion 



Operand 


Description 


Internal Format 


External Format 


Operand A 


source pixels 


pixels 


packed stream 


Operand B 


mulri output channel 
color conversion tables 


other 


multi channel esc 
tables 


Operand C 


Interval and Fraction 
Tables 




I&F table format 


Result 


pixels 


pixels 


packed stream 




bytes 


unpacked bytes 


unpacked bytes, 
packed stream 
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The instruction field encoding for color space conversion instruction is illustrated 

in Fig. 64 with the following nvior opcode encoding for the color conversion 
instructions. 



Table 13 > Minor Opcode Encoding for Color Conversion Instructors 



Field 

trans[3:0] 


Description 

0 = do not apply translation and clamping step to corresponding 
output value on this channel 


M 


0 - single channel color table format 

1 = multi channel color table format 



^ Fig. 65 shows a method of converting a stream of RGB pixels into CYMK color 

values according to the MOGCS mode. In step Sj, a stream of 24 bit ROB pixels are 
received by the pixel organiser 246 (Fig. 2). In step the pixel organiser 246 

10 determines the 4 bit interval values and the 8 bit fractional values of each input pixel from 
lookup tables, in the manner previously discussed with respect to Fig?. 56 and 57. The 
interval and fractional values of the input pixel designate which intervals and fractions 
along the intervals in which the input pixel lies. In step S 3 , the main data path unit 242 
concatenates the 4 bit intervals of the red, green and blue color components of the input 

is pixel to form a 12 bit address word and supplies this 12 bit address word to the data cache 
controller 240 (Fig. 2). In step S 4 , the data cache controller 240 converts this 12 bit 
address word into 8 different 9 bit addresses, in the manner previously discussed with 
respect to Table 12 and Fig. 62. These 8 different addresses designate the location of the 
8 output color values CV(P0>CV(P7) in the respective memory banks 573 (Fig. 62) of 

20 the data cache 230 (Fig. 2). In step S 3 , the data cache controller 240 (Fig. 2) remaps the 
8 different 9 bit addresses in the manner described previously with respect to Fig. 63. Tn 
this way, the most significant bit of the red and green 4 bit intervals are mapped to the two 
most significant bits of the 9 bit addresses. 

In step Sg, the data cache controller 240 then compares the two most significant 



25 bits of the 9 bit addresses with respective 2 bit tags in memory 587 (Fig. 63> If the 2 bit 
tag does not correspond to the two most significant bits of the 9 bit addresses, then the 
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output color values CV(P(J)-CV(P7) do not exist in the cache memory 230. Hence, in 
step S 7 , all the output color values corresponding to the 2 bit tag entry for that line arc 
read from external memory into the data cache 230. If the 2 bit tag corresponds to these 
two most significant bits of the 9 bit addresses, then the data cache controller 240 
5 retrieves in step S 8 the eight output color values CV(P0)-CV(P7) in the manner discussed 
previously with respect to Fig. 62. In this way, the eight output color values CV(P0)- 
CV(P7) surrounding the input pixel are retrieved by the main data path unit 242 from the 
data cache 230. In step S 7 , the main data path unit 242 interpolates the output color 
values CV(P0)-CV(P7) utilising the fractional values determined in step S 2 and outputs 

10 the interpolated output color values. 

It will be evident to the man skilled in the art, that the storage space of the data 
cache storage may be reduced further by dividing the RGB color space and the 
corresponding output color values into more than four quadrants, for example 32 blocks. 
In the latter case, the data cache can have the capacity of storing only a 1/32 block of 

15 output color values. 

Tt will also be evident to the man skilled in the art, that the data caching 
arrangement utilized in the MOGCS mode can also be used in a single output general 
conversion mode. Hence, in the latter mode the storage space of the data cache can also 
be reduced. 

20 3.113 JPEG Coding/Decoding 

It is well known that a large number of advantages can be obtained from storing 
images in a compressed format especially m relation to the saving of memory and the 
speed of transferring images from one place to another. Various popular standards have 
arizen tor image compression. One very popular standard is the JPEG standard and for a 

25 full discussion of the implementation of this standard reference is made to the well known 
text JPEG: Still Image Data Compression Standard by Pennebaker and Mitchell 
published 1993 by Van Nostrand Rcinhold. The co-processor 224 utilizes a subset of 
the JPEG standard in the storage of images. The JPEG standard has the advantage that 
large factor compression can be gained with the retention of substantial image quality. 

30 Of course, other standards for storing compressed images could be utilized. The JPEG 
standard is well-known to those skilled in the art, and the various JPEG alternative 
implementations readily available in the marketplace from manufaaurers including JPEG 
core products for irKxirporation into ASICS. 
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The co-processor 224 implements JPEG compression and decompression of 
images consisting of 1, 3 or 4 color components. One-color-component images may be 
meshed or unmeshed. That is, a single-color-coraponent can be extracted from meshed 
data or extracted from unmeshed data. An example of meshed data is three-color 
5 compon:; ,-s. per pixel riaturn (i.e., RGB *ei datum), -s. * an example of unmesh- ' 

data is where each color component for an image is stored separately such that each color 
component can be processed separately. For three color component images the co- 
processor 224 utilizes one pixel per word, assuming the three color channels to be 
encoded in the lowest three bytes. 

10 The JPEG standard decomposes an image into small two dimensional units 

called minimum coded units (MCU). Each minimal coded unit is processed separately. 
The JPEG coder 241 (Fig. 2) is able to deal with MClTs which are 16 pixels wide and 8 
pixels high for down sampled images or MCU's which are 8 pixels wide and 8 pixels high 
lor images that are not to be down sampled. 

15 Turning now to Fig. 66, there is illustrated the method utilized for down 

sampling three component images* 

The original pixel data 600 is stored in the MUV buffer 250 (Fig. 2) in a pixel 
form wherein each pixel 601 comprizes Y, U and V components of the YUV color space. 
This data is first converted into a MCU unit which comprizes four data blocks 601 - 604. 

20 The data blocks comprize the various color components, with the Y component being 
directly sampled 601, 602 and the U and V components being sub-sampled in the 
particular example of Fig, 13 to form blocks 603, 604. Two forms of sub-sampling axe 
implemented by the co-processor 224, including direct sampling where no filtering is 
applied and odd pixel data is retained while even pixel data is discarded. Alternatively, 

23 filtering of the U and V components can occur with averaging of adjacent values taking 
place. 

An alternative form of JPEG sub-sampling is four color channel sub-sampling as 
illustrated in Fig. 67. In this form of sub-sampling, pixel data blocks of 16 x 8 pixels 
610 each have four components 611 including an opacity component (O) in addition to 
30 the usual Y, U, V components. This pixel data 410 is sub-sampled in a similar manner 
to that depicted in Fig. 66 

However, in this case, the opacity channel is utilized to form data blocks 612, 

613. 
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Turning now to Fig. 68, there is illustrated the JPEG coder 241 of Fig. 2 in more 
detail. The JPEG encoder/decoder 241 is utilized for both JPEG encoding and decoding. 
The encoding process receives block data via bus 620 from the pixel organizer 246 (Fig. 
2). The block data is stored within the MUV buffer 250 which is utilized as a block 
5 staging area. The JPEG encoding process is broken down into a number of well defined 
stages. These stages include: 

1. taking a discrete cosine transform (DCT) via DCT unit 621; 

2. quantising the DCT output 622; 

3. placing the quantized DCT co-efficients in a zig zag order, also carried 
10 out by quantizer unit 622; 

4. predictively encoding the DC DCT co-efficients and run length 
encoding the AC DCT co-efficients carried out by co-efficient coder 623; and 

5. variable length encoding the output of the co-efficients coder stage, 
carried out by Huffman coder unit 624. The output is fed via multiplexer 625 and Rhus 

15 626 to (he result organizer 629 (Fig. 2). 

The JPEG decoding process is the inverse of JPEG encoding with the order of 
operations reversed. Hence, the JPEG decoding process comprizes the steps of inputting 
on Bus 620 a JPEG block of compressed data. The compressed data is transferred via Bus 
630 to the Huffman coder unit 624 which Huffinan decodes data into DC differences and 

20 AC run lengths. Next, the data is forwarded to the co-efflclents coder 623 which 
decodes the AC and DC co-efficients and puts them into their natural order. Next, the 
quantizer unit 622 dequantizes the DC co-efficients by multiplying them by a 
corresponding quantization value. Finally, the DCT unit 621 applies an inverse discrete 
cosine transform to restore the original data which is then transferred via Bus 631 to the 

25 multiplexer 625 for output via Bus 626 to the Result Organizer. The JPEG coder 241 
operates in the usual manner via standard CBus interface 632 which contains the registers 
set by the instructions controller in order to begin operation of the JPEG coder. Further, 
both the quantizer unit 622 and the Huffman coder 624 require certain tables which arc 
loaded in the data cache 230 as required. The table data is accessed via an OBus 

30 interface unit 634 which connects to the operand organizer B unit 247 (Fig. 2) which in 
turn interacts with the data cache controller 240. 



The DCT unit 621 implements forward and inverse discrete cosine transforms on 
pixel data. Although many different types of DCT transforming implementations are 
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known and discussed in the Still Image Data Compression Standard (ibid), the DCT 621 

implenutrts a high speed form of transform more fully discussed in the section h~ dn 
entitled A Fast DCT Apparatus, which may Implement a DCT transform operation in 
accordance with the article entitled A Fast DCT-SQ Scheme for Images by Arai el. al., 

5 pubii.. : in 7" .. as of lr ICE, Vol E71, No. 1L, November 1983 at page 

1095. 

'["he qu- -MrT 622 implement? wte'dtwTi and dequandz.^ion OCT 
components and operates via fetching relevant values from corresponding tables stored in 
the data cache via the OBus interface unit 634. During quantization, the incoming data 

10 stream is divided by values read from quantization tables stored in the data cache. The 
division is implemented as a fixed point multiply. During dequantization, the data 
stream is multiplied by values kept in the dequantization table. 

Turning to Fig. 69, there is illustrated the dequantizer 622 in more detail. The 
quantizer 622 includes a DCT interface 640 responsible for passing data to and receiving 

15 data from the DCT module 621 via a local Bus. During quantization, the quantizer 622 
receives two DCT co-efficients per clock cycle. These values are written to one of the 
quantizers internal buffers 641, 642. The buffers 641, 642 are dual ported buffers used 
to buffer incoming data. During quantization, co-efficient data from the DCT sub- 
module 621 is placed into one of the buffers 641, 642. Once the buffer is full, the data is 

20 read from the buffer in a zig zag order and multiplied by multiplier 643 with the 
quantization values received via OBus interface unit 634. The output is forwarded to the 
co-efficient coder 623 (Fig. 68) via co-efficient coder interface 645. While this is 
happening, the next block of co-efficients is being written to the other buffer. During 
JPEG decompression, the quantizer module dequantizes decoded DCT co-efficients by 

25 multiplying them by values stored in the table. As the quantization and dequantization 
operations ore mutually exclusive, the multiplier 643 is utilized during quantization and 
dequantization. The position of the co-efficient within the block of 8x8 values is used as 
the index into the dequantization table. 

As with quantization, the two buffers 641, 642 are utilized to buffer mcoming 

30 co-efficient data from the oo-efficient coda" 623 (Fig. 68). The data is multiplied with 
its quantization value and written into the buffers in reverse zig zag order. Once full, the 
dequantized co-efficients are read out of the utilized buffer in natural order, two at a time, 
and passed via DCT interface 640 to the DCT sub-module 621 (Fig. 68). Hence the co- 
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efficients coder interface module 645 is responsible for interfacing to the co-efficients 
coder and passes data and receives data from the coder via a local Bus. This module 
also reads data from buffers in zig zag order during compression and writes data to the 
buffers in reverse zig zag order during decompression. Both the DCT interface module 

5 640 and the CC interface module 645 are able to read and write from buffers 641, 642. 
Hence, address and control multiplexer 647 is provided to select which buffer each of 
these interfaces is interacting with under the control of a control module 648, which 
comprizes a state machine for controlling all the various modules in the quantizer. The 
multiplier 643 can be a 16x8, 2's complement multiplier which multiplies DCT co- 

10 efficients by quantization table values. 

Turning again to Fig. 68, the co-efficient coder 623 performs the functions of: 

(a) predictive encoding/decoding of DC co-efficients in JPEG mode; and 

(b) run length encoding/decoding of AC co-efficients in JPEG mode. 
Preferably, the co-efficient coder 623 is also able to be utilized for predictive 

15 encoding/decoding of pixels and memory copy operations as required independently of 
JPEG mode operation. The co-efficient coder 623 implements predictive and run length 
encoding and decoding of DC and AC co-efficients as specified in the Pink Book. A 
standard implementation of predictive encoding and predictive decoding in addition to 
JPEG AC co-efficients run lengthing encoding and decoding as specified in the JPEG 

20 standard is implemented. 

The Huffinan coder 624 is responsible for Huffman encoding and decoding of 
the JPEG data train. In Huffman encoding mode, the run length encoded data is received 
from the co-efficients coder 623 and utilized to produce a Huffman stream of packed 
bytes. Alternatively, or in addition, in Huffinan decoding, the Huffinan stream is read 

25 from the PBus interface 620 in the form of packed bytes and the Huffinan decoded co- 
efficients are presented to the co-efficient coder module 623. The Huffinan coder 624 
utilizes Huffinan tables stored in the data cache and accessed via OBus interface 634. 
Alternatively, the Huffinan table can be hardwired for maximum speed. 

When utilising the data cache for Huffinan coding, the eight banks of the data 

30 cache store data tables as follows with the various tables being described in further detail 
hereinafter. 
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Table 14: Huffman and Quantization Tables as stored in Data Cache 



Bank 


Description 


0 


This bank hold the 256, 16 bit entries of a EHUFCOJDCM or 
EHUFCO table. The least significant bit of the index chooses 

httivv&en two 16 bit itsois in the 32 bit word. Ail 12S lines of 
this bank of memory are used. 


I 1 


This ,yai± iidds the 256, 16 bit entries of a EMUFCO_DC"_ : 
tabic. The least significant bit of the index chooses between the 
two 16 bit items in the 32 bit word. All 128 lines of this bank of 
memory are used. 


2 


This bank holds the 256, 16 bit entries of a EHUFCO_AC J. 
table, The least significant bit of the index chooses between the 
two 16 bit items in the 32 bit word. Al 128 lines of this bank of 
memory are used. 


3 


This bank holds the 256, 16 bit entries of a EHUFCO_AC_2 
table. The least significant bit of the index chooses between the 
two 16 bit items in the 32 bit word. All 128 lines of this bank of 
memory are used. 


4 


This bank holds the 256, 4 bit entires of a EHUFSI_DCJ or 
EHUFSI table, as well as the 256, 4 bit entires of a 
EHUFSIJDC_2 table. All 128 lines of this bank of memory arc 
used. 


5 


TWs bank holds the 256, 4 bit entries of a EHUFSI_AC_1 table, 
as well as the 256, 4 bit entries of a EHUFSI_AC_2 table. All 
128 lines of this bank of memory are used. 


6 


Not used 


7 


Ibis banks holds the 128, 24 bit entries of the quantization table. 
It occupies the least significant 3 bytes of all 128 lines of this 
bank of memory. 



Turning now to Fig. 70, the Huffman coder 624 consists primarily of two 
independent blocks being an encoder 660 and a decoder 661. Both blocks 660,661 share 
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the same OBus interface via a multiplexer module 662. Each block has its own input 
and output with only oris block active ai a time, depending on the function performed by 
the JPEG encoder. 

a. Encoding 

5 During encoding in JPEG mode, Huffman tables are used to assign codes of 

varying lengths (up to 16 bits per code) to the DC difference values and to the AC run- 
length values, which arc passed to the HC submodule from the CC submodule. These 
tables have to be preloaded into the data cache before the start of the operation. The 
variable length cade words are then concatenated with the additional bits for DC and AC 

10 co-efficients (also passed from the CC submodule, then packed into bytes. A X ! 00 byte is 
stuffed in if an X'FF byte is obtained as a result of packing. If there is a need for an 
RST m marker it is inserted. This may require byte padding with "1" bits of the last 
Huffman code and X'OO byte shifting if the padded byte results in X'FF. The need for an 
RST m marker is signalled by the CC submodule. The HC submodule inserts the EOI 

15 marker at the end of image, signalled by the "final ■ signal on the PBus-CC slave interface. 
The insertion procedure of the EOI marker requires similar packing, padding and 
stuffing operations as for RSTm markers. The output stream is finally passed as packed 
bytes to the Result Organizer 249 for writing to external memory. 

In non-JPEG mode data is passed to the encoder from the CC submodule (PBus- 

20 CC slave interface) as unpacked bytes. Each byte is separately encoded using tables 
preloaded into the cache (similarly to JPEG mode), the variable length symbols are then 
assembled back into packed bytes and passed to the Results Organizer 249. The very 
last byte in the output stream is padded with l's. 

b. Decoding 

25 Two decoding algorithms are implemented: fast (real time) and slow (versatile). 

The fast algorithm works only in JPEG mode, the versatile one works both in JPEG and 
non-JPEG modes. 

The fast JPEG Huffman decoding algorithm maps Huffman symbols to either 
DC difference values or AC run-length values. It is specifically tuned for JPEG and 
30 assumes that the example Huffman tables (K3, K4, K5 and K6) were used during 
compression. The same tables are hard wired in to the algorithm allowing 
decompression without references to the cache memory. This decoding style is intended 
to be used when decompressing images to be printed where certain data fates need to be 
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guaranteed. The data rate for the HC submodule decompressing a band (a block 
between RST ra markers) is almo?? one DC/AC co-efficient per clock cycle. One clock 
cycle delay between the HC submodule and CC sub-module may happen for each X'OO 
stuff byte being removed from the data stream, however this is strongly data dependent. 
:: The Hulirriaii feeder operas In a fa,...^ ::\odc for the ex.raction c>v* 

Huffinan symbol per clock cycle. The fast Huffman decoder is described in the section 
herein entitled Decoder of Variable Length Codi ,>. 

Additionally, the Huffinan decoder 661 also implements a heap-based slow 
decoding algorithm and has a structure 670 as illustrated in Fig. 71 . 
10 For a JPEG encoded stream, the STRIPPER 671 removes the X r 00 stuff bytes, 

the X'FF fill bytes and RST m markers, passing Huffman symbols with concatenated 
additional bits to the SHIFTER 672. This stage is bypassed for Huffman-only coded 
streams. 

The first step in decoding a Huffinan symbol is to look up the 256 entries 
15 HUFVAL table stored in the cache addressing it with the first 8 bits of the Huffinan data 
stream. If this yields a value (and the true length of the corresponding Huffman symbol), 
the value is passed on to the OUTPUT FORMATTER 676, and the length of the symbol 
and the number of the additional bits for the decoded value are fed back to the SHIFTER 
672 enabling it to pass the relevant additional bits to the OUTPUT FORMATTER 676 
20 and align the new front of the Huffinan stream presented to the decoding unit 673. The 
number of the additional bits is a function of the decoded value. If the first look up does 
not result in a decoded value, which means that the Huffman symbol is longer than 8 bits, 
the heap address is calculated and successive heap (located in the cache, too) accesses are 
performed following the algorithm until a match is found or an "illegal Huffman symbol" 
25 condition met. A match results in identical behavior as in case of the first match and 
"illegal Huffman symbol 1 ' generates an interrupt condition. 

The algorithm for heap-based decoding algorithm is as follows: 
loop until end of image 

set symbol length N to 8 
30 get first 8 bits of the input stream Into INDEX 



fetch HUFVAL(INDEX) 

if HUFVAL(INDEX) == OOxx 0000 111 - (ELL) 
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signal "illegal Huffinan symbol" 

exit 

elsif HUFVAL(INDEX) == Innn eeee eeee -- (HIT) 
pass nnn bits to cece eeee as the yalue 
5 pass symbol Length N = decimal (nim)/*000 

as symbol length 8*/ 
adjust the input stream 
break 

else/* HUFVAL (INDEX) == Oliii iiii iiii - (MISS)*/ 
10 set HEAPINDEX = ii iiii iiii - (we assume heapbase = 0) 

setN =9 

if 9th bit of the input stream == 0 
increment HEAPENDEX 

.fi 

t 5 fetch VALUE = HEAP (HEAPINDEX) - (code for 9th bit) 

loop 

if VALUE == 0001 0000 1111 -- (ILL) 
signal "illegal Huffinan symbol" 
exit 

20 elsif VALUE ~ 1000 eeee eeee 

pass eeee eeee as the value 
pass symbol length N 
adjust the input stream 
break 

25 else/* VALUE == Oliii iiii iiii - (MISS) */ 

set N = N + 1 (HEAPINDEX = ii iiii iiii) 
if Nth bit of the input stream = 0 
increment HEAPINDEX 

fi 

30 fetch VALUE = HEAP (HEAPINDEX) 

pool 

pool 
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The STRIPPER 671 removes any X'QO stuff bytes, XTF Sll bytes and RST m 

mark^s from the mco™ hg JPEG 671 coded strcar: and jr-sses "clean" Huffman symbol , 
with concatenated additional bits to the shifter 672. There arc no additional bits in 
Huffman-only encoding, so in this mode the passed stream consists of Huffman symbols 

:.- onlj. 

The shifter 672 block has a 16 bit output register in which it presents the next 
Huffman symbol to the decoding unit 673 (bitstrearn running from MSB to LSB). Often 
the symbol is shorter than 16 bits, but it is up to the decoding unit 673 to decide how 
many bits are currently being analysed. The shifter 672 receives a feedback 678 from 

10 the decoding unit 673, namely the length of the current symbol and the length of the 
following additional bits for the current symbol (in JPEG mode), which allows for a shift 
and proper alignment of the beginning of the next symbol in the shifter 672. 

The decoding unit 673 implements the core of the heap based algorithm and 
interfaces to the data cache via the OBus 674. It incorporates a Data Cache fetch block, 

15 lookup value comparator, symbol length counter, heap index adder and a decoder of the 
number of the additional bits (the decoding is based on the decoded value). The fetch 
address is interpreted as follows: 



Table 15: Fetch Address 



| Field (bits) 


Description 


[32:25] 


Index into dequantization tables. 


[24:19] 


Not used. 


[18:9] 


Index into the heap. 


[8:0] ; 


Index into Huffman decode table. 



The OUTPUT FORMATTER block 676 packs decoded 8-bit values (standalone 
Huffman mode), or pocks 24-bit value + additional bits + RST m marker information 
(JPEG mode) into 32-bit words. The additional bits are passed to the OUTPUT 
FORMATTER 676 by the shifter 672 after the decoding unit 673 decides on the start 
25 position of the additional bits for the current symbol. The OUTPUT FORMATTER 673 
also implements a 2 deep FIFO buffer using a one word delay for prediction of the final 
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or slow) tries to decode the trailing padding bits at the end of the input bitstream. This 
situation is normally detected by the shifter and instead of asserting the "illegal symbol" 
interrupt, it asserts a "force Gnal" signal. Active "force final" signal forces the OUTPUT 
FORMATTER 676 to signal the last but one decoded word as "final" (this word is still 
5 present in the FIFO) and discard the very last word which does not belong to the decoded 
stream. 

The Huffman encoder 660 of Fig, 70 is illustrated in Fig. 72 in more detail. The 
Huffman encoder 660 maps byte data into Huffman symbols via look up tables and 
includes a encoding unit 681, a shifter 682 and a OUTPUT FORMATTER 683 with the 

10 lookup tables being accessed from the cache. 

Each submitted value 685 is coded by the encoding unit 681 using coding tables 
stored in the data cache. One access to the cache 230 is needed to encode a symbol, 
although each value being encoded requires two tables, one that contains the 
corresponding code and the other that contains the code length. During JPEG 

15 compression, a separate set of tables is needed for AC and DC co-efficients. If 
subsampling is performed, separate tables are required for subsampled and non 
subsampled components. For non-JPEG compression, only two tables (code and size) 
are needed. The code is then handled by the shifter 682 which assembles the outgoing 
stream on bit level The Shifter 682 also performs RST m and EOI markers insertion 

20 which implies byte padding, if necessary. Bytes of data are then passed to the OUTPUT 
FORMATTER 683 which does stuffing (with XXX) bytes), filling with XTF bytes, also 
the FF bytes leading the marker codes and formatting to packed bytes. In the non-JPEG 
mode, only formatting of packed bytes is required. 

Insertion of XTF bytes is handled by the shifter 682, which means that the output 

25 formatter 683 needs to tell which bytes passed from the shifter 682 represent markers, in 
order to insert an XTF byte before* Hiis is done by having a register of tags which 
correspond to bytes in the shifter 682. Each marker, which must be on byte boundaries 
anyway, is tagged by the shifter 682 during marker insertion. The packer 683 does not 
insert stuff bytes after the X H FF M bytes preceding the markers. The tags are shifted 



30 synchronously with the main shift register. 

The Huffman encoder uses four or eight tables during JPEG compression, and 
two tables for straight Huffman Encoding. The tables utilized are as follows: 
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Table 16: Tables Used by the Huffman Encoder 



,,, ...J...... . ... ,,rmm 

Name 


Size 


Description 


EHUFSI 


256 


Huffman code sizes. Used during straight 

Huffman encoding. Uses the coded value as 
an index. 


EHUFCO 


256 


Huffman code values used during srraigh* 
Huffman encoding. Uses the coded value as 
an index. 


EHUFSIJDC.1 


16 


Huffman codes sizes used to code DC co- 
efficients during JPEG compression. Uses 
magnitude category as the index. 


EHUFCO_DC_l 


16 


Huffman code values used to code DC co- 
efficients during JPEG compression. Uses 
magnitude category as an index. Used for 
subsampied blocks. 


EHUFSI_DC_2 


16 


Huffman code sizes used to code DC co- 
efficients during JPEG compression. Uses 
magnitude category as an index. Used for 
subsampied blocks. 


EHUFCO„DC„2 


16 


Huffman code sizes used to code DC co- 
efficients during JPEG compression. Uses 
magnitude category as an index* Used for 
subsampied blocks. 
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Name 


Size 


Description 


EHUFS1_AC_1 


256 


Huffman code sizes used to code AC co- 
efficients during JPEG compression. Uses 
magnitude category and run-length as an 
index. 


EHUFC0_AC_1 


256 


Huffman code sizes used to code AC co- 
efficients during JPEG compression. Uses 
magnitude category and run-length as an 
index. 


EHUFSI_AC2 


256 


Huffman code sizes used to code AC co- 
efficients during JPEG compression for 
subsampled components. Uses magnitude 
category and run-length as an index. 


EHUFC0_AC2 


256 


Huffman code sizes used to code AC co- 
efficients during JPEG compression for 
subsampled components. Uses magnitude 
category and run-length as an index. 



3.17.4 Table Indexing 

Huffman tables arc stored locally by the co-processor data cache 230* The data 
cache 230 is organized as a 128 line, direct mapped cache, where each line comprizes 8 
words. Each of the words in a cache line axe separately addressable, and the Huffman 
decoder uses this feature to simultaneously access multiple tables. Because the tables 
are small (<= 256 entries), the 32 bit address field of the OBus can carry indexes into 
multiple tables. 

As noted previously, in JPEG slow decoding mode, the data cache is utilized for 
storing various Huffman tables. The format of the data cache is as follows: 



Table 17: Bank Address for Huffman and Quantization Tables 



Bank 



0to3 



Description 



These banks bold the 1024, 16 bit entries of the heap. The 
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1 7: :z z i " . ' J J L ■ ■ . ■ - - ■ — . ■ — ... * h i... ■ . an^^s — ■ ... _.s= 

least significant index bit selects between the two 16 bit words 

I ' in sr-h bank. AJ! ?.?8 li< ,s of the four banks of memory ^ 

used. 
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I Bank 


Description 




This bank holds the 512, least significant 8 bits of the 12 bit 
entries of the DC Huffman decode table. The least significant 
two bils of the index chooses between the four, byte items in 
the 32 bit word. All 128 line of this bank of memory are 
used. 


5 


This bank holds the 512, least significant 8 bits of the 12 bit 
entires of the AC Huffman decode table. The least significant 
two bits of the index chooses between the four, byte items hi 
the 32 bit word. All 128 lines of this bank of memory are 
used. 


6 


This bank holds the most significant 4 bits of both the DC and 
AC Huffman decode tables. The least significant 2 bits of 
each index chooses between the 4 respective nibbles within 
each word. 


7 


This bank holds the 128, 24 bit entires of the quantization 
table. It occupies the least significant 3 bytes of all 128 lines 
of this bank of memory. 



Prior to each JPEG instruction being executed by the JPEG coder 241 (Fig. 2) the 
appropriate image width value in the image dimensions register (POJDR) or (RO JDR) 
5 must be set. As with other instructions, the length of the instruction refers to the number 
of input data items to be processed. This includes any padding data and accounts for any 
suh-sampling options utilized and for the number of color channels used. 

All instructions issued by the co-processor 224 may utilize two facilities for 
limiting the amount of output data produced. These facilities are most useful for 
10 instructions where the input and output data sizes are not the same and in particular where 
the output data size is unknown, such as for JPEG coding and decoding. The facilities 
determine whether the output data is written out or merely discarded with everything else 
being as if the instruction was properly processed. By default, these facilities arc 
normally disabled and can be enabled by enabling the appropriate bits in the RO_CFG 



15 register. JPEG instructions however, include specific option for setting these bits. 
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Preferably, when utilising JPEG compression, the co-processor 224 provides facilities for 
"cutting" and "liin'tin-" of output data. 

Turning to Fig. 73, there is now described the process of cutting and limiting. 
An input image 690 may be of a certain height 691 and a certain width 692. Often, only 
•c - -ortK' - f the irvK* is of interest with other portions being irrelevant for the pvzpooes 
of printing out. However, the JPEG encoding system deals with 8x8 blocks of pixels. 
V may be the case that, firstly, the image width is not an cxr.:\ Triple of 8 and 
additionally, the section of interest comprising MCU 695 does not fit across exact 
boundaries. An output cut register, RO_cut specifies the number of output bytes at 696 
at the beginning of the output data stream to discard. Further, an output limit register, 
RO_LMT specifies the maximum number of output bytes to be produced. This count 
includes any bytes that do not get written to memory as a result of the cut register. 
Hence, it is possible to target a final output byte 698 beyond which no data is to be 
outputted. 

There are two particular cases where the cut and limited functionality of the 
JPEG decoder is considered to be extremely useful. The first case, as illustrated in Fig. 
74, is the extraction or decompression of a sub-section 700 of one strip 701 of a 
decompressed image. The second useful case is illustrated in Fig. 75 wherein the 
extraction or decompression of a number of complete strips (eg. 711, 712 and 713) fc 
required from an overall image 714. 

The instruction format and field encoding for JPEG instructions is as illustrated 
in Fig. 76. The minor opcode fields are interpreted as follows: 



Table 18: Instruction Word - Minor Opcode Fields 



Field 


Description 


D 




0 = encode(com press) 






1 = decode(decompress) 


M 




0 = single color channel 

1 - multi channel 


4 




0 = three channel 

1 = four channel 


S 


0 = do not use a sub/up sampling regime 
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1 = use a subsampling regime 


H 


0 = use fast Huffman coding 

1 = use general purpose Huffman coding 


C 


0 = do not use cut register 

1 = use cut register 


T 


0 = do not truncate on output 

1 = truacate on output 


F 


0 = do not low pass filter before subsampling 

1 = low pass filter before subsampling 



3.175 Data Coding Instructions 

Preferably, the co-processor 224 provides for the ability to utilize portions of the 
JPEG coder 241 of Fig. 2 ia other ways. For example, Huffman coding is utilized for 
5 both JPEG and many other methods of compression. Preferably, there is provided data 
coding instructions for manipulating the Huffman coding unit only for hierarchial image 
decompression. Further, the run length coder and decoder and the predictive coder can 
also be separately utilized with similar instructions, 
3.17.6 A Fast OCT Apparatus 

10 Conventionally, a discrete cosine transform (DCT) apparatus as shown in Fig. 77 

performs a full two-dimensional (2-D) transformation of a block of 8x8 pixels by first 
performing a 1-D DCT on the rows of the 8x8 pixel block. It then performs another 1-D 
DCT on the columns of the 8x8 pixel block. Such an apparatus typically consists of an 
input circuit 1096, an arithmetic circuit 1104, a control circuit 1098, a transpose memory 

15 circuit 1090, and an output circuit 1092. 

The input circuit 1096 accepts 8-bit pixels from the 8x8 block. The input circuit 
1096 is coupled by intermediate multiplexers 1100, 1102 to the arithmetic circuit 1004. 
The arithmetic circuit 1104 performs mathematical operations on either a complete row or 
column of the 8x8 block. The control circuit 1098 controls all the other circuits, and 

20 thus implements the DCT algorithm. The output of the arithmetic circuit is coupled to 
the transpose memory 1090, register 1095 and output circuit 1092. The transpose 
memory is in turn connected to multiplexer 1100, which provides output to the next 
multiplexer 1102. The multiplexer 1102 also receives input from the register 1094. 
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The transpose circuit 1090 accepts 8x8 block data in rows and produces that data in 

columns. The output circuit 1092 provides the coefficients of the DCT performed v. : 
8x8 block of pixel data. 

In a typical DCT apparatus, it is the speed of the arithmetic circuit 1104 that 

5 t „-iJ3y d,:vrnV :s the c r all ir-s vd of the ?f-p -tiw. sirxie the arithmetic rrcuit ll'V : s 
the most complex. 

The arithmetic cYe^t 1104 of Fig. 77 is typical? y implemented by b real* ins the 
arithmetic process down into several stages as described hereinafter with reference to Fig, 
78. A single circuit is then built that implements each of these stages 1114, 1148, 1152, 

10 1156 using a pool of common resources, such as adders and multipliers. Such a circuit 
1104 is mainly disadvantageous due to it being slower than optimal, because a single, 
common circuit is used to implement the various stages of circuit 1104. This includes a 
storage means used to store intermediate results. Since the time allocated for the dock 
cycle of such a circuit must be greater or equal to the time of the slowest stage of the 

15 circuit, the overall time is potentially longer than the sum of all the stages. 

Fig. 78 depicts a typical arithmetic data path, in accordance with the apparatus of 
Fig. 77, as part of a DCT with four stages. The drawing does not reflect the actual 
implementation, but instead reflects the functionality. Each of the four stages 1144, 
1148, 1152, and 1156 is implemented using a single, reconfigurable circuit. It is 

20 reconfigured on a cyde-by-cycle basis to implement each of the four arithmetic stages 
1144» 1148, 1152, and 1156 of the 1-D DCT. In this circuit, each of the four stages 1144, 
1148, 1152, and 1156 uses pool of common resources (e.g. adders and multipliers) and 
thus minimises hardware. 

i 

However, the disadvantage of this circuit is that it is slower than optimal The four 
25 stages 1144, 1148, 1152, and 1156 are each implemented from the same pool of adders 
and multipliers. The period of the dock is therefore determined by the speed of the 
slowest stage, which in this example is 20 ns (for block 1144). Adding in the delay (2ns 
each) of the input and output multiplexers 1146 and 1154 and the delay (3ns) of the flip- 
flop 1150, the total time is 27 ns. Thus, the fastest this DCT implementation can run at 
30 is 27 ns. 



Pipelined DCT implementations are also well known. The drawback with such 
implementations is that they require Large amounts of hardware to implement Whilst 
the present invention does not offer the same performance in terms of throughput, it offers 
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an extremely good performance/size compromise, and good speed advantages over most 
of the current DCT implementations. 

Fig. 79 shows a block diagram of the preferred form of discrete cosine transform 
unit utilized in the JPEG coder 241 (Fig. 2) where pixel data is inputted to an input circuit 
5 1126 which captures an entire row of 8-bit pixel data. The transpose memory 1118 
converts row formatted data into column formatted data for the second pass of the two 
dimeasional discrete cosine transform algorithm. Data from the input circuit 1126 and 
the transpose memory 1118 is multiplexed by multiplexer 1124, with the output data from 
multiplexer 1124 presented to the arithmetic circuit 1122. Results data fium the 

10 arithmetic circuit 1 122 is presented to the output circuit 1120 after the second pass of the 
process. The control circuit 1116 controls the flow of data through the discrete cosine 
transform apparatus. 

During the first pass of the discrete cosine transform process row data from the 
image to be transformed, or transformed image coefficients to be transformed back to 

15 pixel data is presented to the input circuit 1126. During this first pass, the multiplexer 
1124 is configured by the control circuit 1116 to pass data from the input circuit 1126 to 
the arithmetic circuit 1122. 

Turning to Fig. 80, there is shown the structure of the arithmetic circuit 1122 in 
more detail. In the case of performing a forward discrete cosine transform, the results 

20 from the forward circuit 1138 which is utilized to calculate the forward discrete cosine 
transform is selected via the multiplexer 1142, which is configured in this way by the 
control circuit 1116* When an inverse discrete cosine transform is to be performed, the 
output from the inverse circuit 1140 is selected via the multiplexer 1142, as controlled by 
the control circuit 1 126. During the first pass, after each row vector has been processed 

75 by the arithmetic circuit 1122 (configured in the appropriate way by control circuit 1116), 
that vector is written into the transpose memory 1118. Once all eight row vectors in an 
8x8 block have been processed and written into the transpose memory 1118, the second 
pass of the discrete cosine transform begins. 

During the second pass of either the forward or inverse discrete cosine 

30 transforms, column ordered vectors are read from the transpose memory 1118 and 
presented to the arithmetic circuit 1122 via the multiplexer 1124. During this second 
pass, the multiplexer 1124 is configured by the control circuit to ignore data from the 
input circuit 1136 and pass column vector data from the transpose memory 1118 to the 
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arithmetic circuit 1122. The multiplexer 1142 in the arithmetic circuit 1122 is 
configured : y the control circuit 1116 to pass results data from the inver -e circuit 1140 to 
the output of the arithmetic circuit 1122. When results from the arithmetic circuit 1122 
are available, they are captured by the output circuit 1120 under direction from the control 
5 circuit 1 1 16 to be output ted sometime later. 

The arithmetic circuit 1122 is completely combinatorial, in that is there are no 
.Uorage elements ,xi the circuit storing U: : > .oned 1 ; -c results. The eofji. o; ci. 1116 
knows how long it takes for data to flow from the input circuit 1136, through the 
multiplexer 1124 and through the arithmetic circuit 1122, and so knows exactly when to 

10 capture the results vector from the outputs of the arithmetic circuit 1122 into the output 
circuit 1120. The advantage of having no intermediate stages in the arithmetic circuit 
1122 is that no time is wasted getting data in and out of intermediate storage elements, but 
also the total time taken for data to flow through the arithmetic circuit 1122 is equal to the 
sum of all the internal stages and not N times the delay of the longest stage (as with 

15 conventional discrete cosine transform implementations), where N is the number of stages 
in the arithmetic circuit. 

Referring to Fig. 81, the total time delay is simply the sum of the four stage 1158, 
1160, 1162, 1164, which is 20 ns + 10 ns + 12 ns + 15 ns = 57 ns, which is faster that the 
circuit depicted in Fig. 78* The advantage of this circuit is that it provides an 

20 opportunity to reduce the overall system's clock period. Assuming that four clock cycles 
are allocated to getting a result from the circuit depicted in Fig. 81, the fastest run time for 
the entire DCT system would be 57/4 ns (14,25 ns), which is a significant improvement 
over the circuit in Fig- 78 which only allows for a DCT clock period of substantially 27 
ns. 

25 An examplary implementation of the present DCT apparatus might, but not 

necessarily, use the DCT algorithm proposed in the paper to The Transactions of the 
IEICE, Vol. E 71. No. 11, November 1988, entitled A Fast DCTSQ Scheme for Images at 
page 1095 by Yukihiro Aral, Takeshi Agui and Masayuld Nakajima. By implementing 
this algorithm in hardware, it can then easily be placed in the current DCT apparatus in 

30 the arithmetic circuit 1122, Likewize, other DCT algorithms may be implemented in 
hardware in place of arithmetic circuit 1122. 



3.17.7 Huffman Decoder 
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The aspects of the following embodiment relate to a method and apparatus for 
variable-length codes interleaved with variable length bit fields. In particular, the 
embodiments of the invention provide efficient and fast, single stage (dock cycle) 
decoding of variable-length coded data in which byte aligned and not variable length 

5 encoded data is removed from the encoded data stream in a separate pre-processing block. 
Further, information about positions of the removed byte-aligned data is passed to the 
output of the decoder in a way which is synchronous with the data being decoded In 
addition, it provides fast detection and removal of not byte-aligned and not variable length 
encoded bit fields that are still present in the pre-processed input data. 

10 The preferred embodiment of the present invention preferably provides for a fast 

Huffman decoder capable of decoding a JPBG encoded data at a rate of one Huffman 
symbol per clock cycle between marker codes. This is accomplished by means of 
separation and removal of byte aligned and not Huffman encoded marker headers, marker 
codes and stuff bytes from the input data first in a separate pre-processing block. After 

15 the byte aligned data is removed, the input data is passed to a combinatorial data-shifting 
block, which provides continuous and contiguous filling up of the data decode register 
that consequently presents data to a decoding unit. Positions of markets removed from 
the original input data stream are passed on to a marker shifting block, which provides 
shifting of marker position bits synchronously with the input data being shifted in the data 

20 shifting block. 

The decoding unit provides combinatorial decoding of the encoded bit field 
presented to its input by the data decode register. The bit field is of a fixed length of n 
bits. The output of the decoding unit provides the decoded value (v) and the actual 
length (m) of the input code, where m is less than or equal to n* It also provides the 

25 length (a) of a variable length bit field, where (a) is greater than or equal to 0. The 
variable-length bit field is not Huffman encoded and follows immediately the Huf&nan 
code. The n-long bit field presented to the input of the decoding unit may be longer than 
or equal to the actual code. The decoding unit determines the actual length of the code 
(m) and passes it together with the length of the additional bits (a) to a control block. 

30 The control block calculates a shift value (a+m) driving the data and marker shifting 
blocks to shift the input data for the next decoding cycle. 

The apparatus of the invention can comprise any combinatorial decoding unit, 



including ROM, RAM, FLA or anything else based as long as It provides a decoded value, 
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the actual length of the input code, and the length of the following not Huffman encoded 
bit field within a given time frame. 

In the illustrated embodiment, the decoding unit outputs predictively encoded DC 
difference values and AC run-length values as defined in JPEG standard. The not 

5 Huffman encoded bit fields, which are extracted from the input data .simultaneously with 
decoded values, represent additional bits determining the value of the DC and AC 
coefficients as defined in JPEG standard. Another kind of not Huffman encoded bit 
fields, which arc removed from the data present in the data decode register, arc padding 
bits as defined in JPEG standard that precede byte-aligned markers in the original input 

10 data stream. These bits are detected by the control block by checking the contents of a 
padding zone of the data register. The padding zone comprises up to k most significant 
bits of the data register and is indicated by the presence of a marker bit within k most 
significant bits of the marker register, position of said marker bit limiting the length of the 
padding zone. If all the bits in the padding zone arc identical (and equal to Is in case of 

15 JPEG standard), they are considered as padding bits and are removed from the data 
register accordingly without being decoded. The contents of the data and marker 
registers are then adjusted for the next decoding cycle. 

The exemplary apparatus comprises an output block that handles formatting of the 
outputted data according to the requirements of the preferred embodiment of the invention, 

20 It outputs the decoded values together with the corresponding not variable length encoded 
bit fields, such as additional bits in JPEG, and a signal indicating position of any inputted 
byte aligned and not encoded bit fields, such as markers in JPEG, with respect to the 
decoded values. 

Data being decoded by the JPEG coder 241 (Fig. 2) is JPEG compatible and 
25 comprizes variable length Huffman encoded codes interleaved with variable length not 
encoded bit fields colled "additional bits", variable length not encoded bit fields called 
"padding bits" and fixed length, byte aligned and not encoded bit fields called "markers", 
"stuff bytes'* and "fill bytes". Fig. 82 shows a representative example of input data. 

The overall structure and the data flow in the Huffman decoder of the JPEG 
30 coder 241 is presented m Fig 83 and Fig. 84, where Fig. 83 illustrates the architecture of 
the Hufi&nan decoder of the JPEG data in more detail. The stripper 1171 removes 
marker codes (code FFXX hex , XX being non zero), fill bytes (code FF hcx ) and stuff 
bytes (code 0% tx following code FF^^ that is all byte aligned components of the input 
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data, which are presented to the stripper as 32 bit words. The most significant bit of the 
first word to be processed is the head of the input bit stream. In the stripper 1171, the 
byte aligned bit fields are removed from each input data word before the actual decoding 
of Huffman codes takes place in the downstream parts of the decoder. 

The input data arrives at the stripper's 1171 input as 32-bit words, one word per 
clock cycle. Numbering of the input bytes 1211 from 0 to 3 is shown in Fig. 85. If a 
byte of a number (i) is removed because it is a fill byte, a stuff byte or belongs to a marker, 
the remaining bytes of numbers (i-1) down to 0 are shifted to the left on the output of the 
stripper 1171 and take numbers (i) down to 1. Byte 0 becoming a "don't care" byte. 
Validity of bytes outputted by the stripper 1171 is also coded by means of separate output 
tags 1212 as shown in Fig. 85. The bytes which are not removed by the stripper 1171 
are left aligned on die stripper's output. Each byte on the output has a corresponding tag 
indicating if the corresponding byte is valid (i.e, passed on by the stripper 1171), or 
invalid (i.e. removed by the stripper 1171) or valid and following a removed marker. 
The tags 1212 control loading of the data bytes into the data register 1182 through the 
data shifter and loading of marker positions into the marker register 1183 through the 
marker shifter. The same scheme applies if more than one byte is removed from the 
input word: all the remaining valid bytes are shifted to the left and the corresponding 
output tags indicate validity of the output bytes. Fig. 85 provides examples 1213 of output 
bytes and output tags for various example combinations of input bytes. 

Returning to Fig. 83, the role of the pieshifter and postshifter blocks 1172, 1173, 
1180, 1181 is to assure loading of the data into the corresponding data register 1182 and 
marker register 1183 in a contiguous way whenever there is enough room in the data 
register and the marker register. The data shifter and the marker shifter blocks, which 
consist of the respective pre- and postshifiers, are identical and identically controlled. 
The difference is that while the data shifter handles data passed by the stripper 1171, the 
marker shifter handles the tags only and its role is to pass marker positions to the output 
of the decoder in a way synchronous with the decoded Huffman values. The outputs of 
the postshifters 1180, 1181 feed direcdy to the respective registers 1182, 1183, as shown 
inFig.83. 

In the data preshifter 1172, as also shown in Fig. 86, data arriving from the 
stripper 1171 is firstly extended to 64 bits by appending 32 zeroes to the least significant 
bit 1251. Then the extended data is shifted in a 64 bit wide barrel shifter 1252 to the 
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right by a number of bits currently present in the data register 1182. This number is 
provided by the control logk 1185 which keeps track of how many valid bits are there in 
the data 1182 and marker 1183 registers. The barrel shifter 1252 then presents 64 bits to 
the multiplexer block 1253, which consists of 64 2x1 elementary multiplexers 1254. 

5 r ■,: ; T -mcntary 2x1 multiplexer 1254 takes .-as inputs on>? bit from the bam ! -hifter ' ! \. 
and one bit from the data register 1 182. It passes the data register bit to the output when 
f.hi" 'v: h still valid n the. reg ;w ?r. O:b^ ; j70% h passes the barre' shifter's 1252 bit 
to the output. The control signals to ail the elementary multiplexers 1254 are decoded 
from a control block's shift control 1 signals as shown in Fig. 86, which are also shown in 

1U Fig. 87 as preshifter control bits 0...5 of register 1223. The outputs of the elementary 
multiplexers 1254 drive a barrel shifter 1255. It shifts left by the number of bits 
provided on a 5 bit control signal shift control 2 as shown in Fig 86. These bits 
represent the number of bits consumed from the data register 1182 by the decoding of the 
current data, which can* be either the length of the currently decoded Huffman code plus 

15 the number of the following additional bits, or the number of padding bits to be removed 
if padding bits are currently being detected, or zero if the number of valid data bits in the 
data register 1182 is less then the number of bits to be removed. In this way, the data 
appearing on the output of barrel shifter 1255 contains new data to be loaded into the data 
register 1 182 after a single decoding cycle. The contents of the data register 1182 changes 

20 in such a way that the leading (most significant) bits are shifted out of the register as 
being decoded, and 0, 8, 16, 24 or 32 bits from the stripper 1171 are added to the contents 
of the data register 1182. If there are not enough bits in the data register 1182 to decode 
them, data from the stripper 1171, if available, is still loaded in the current cycle. If 
there is no data available from the stripper 1171 in the current cycle, the decoded bits 

25 from the data register 1182 ait still removed if there is a sufficient amount of them, 
otherwize the content of the data register 1182 does not change* 

The marker preshifter 1173, postshifter 1181 and the marker register 1183 are 
units identical to the data preshifter 1172, data postshifter 1180 and the data register 1182, 
respectively. The data flow inside units 1173, 1181 and 1183 and among them is also 

30 identical as the data flow among units 1172, 1180 and 1182. The same control signals 
are provided to both sets of units by the control unit 1185* The difference is only in the 
type of data on the inputs of the marker preshifter 1173 and data preshifter 1172, as well 
as in how the contents of the marker register 1183 and the data register 1182 are used. 
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As shown in Fig. 88, tags 1261 from the stripper 1171 come as eight bit words, which 
provide two bits for each corresponding byte of data going to the data register 1182. 
According to the coding scheme shown in Fig. 85, an individual two bit tag indicating 
valid and following a marker byte has 1 on the most significant position. Only this most 

5 significant position of each of the four tags delivered by the stripper 1171 simultaneously 
is driven to the input 1262 of the marker preshifter 1173. In this way, on the input to the 
marker preshifter there may be bits set to 1 indicating positions of the first encoded data 
bits following markers. At the same time, they mark the positions of the first encoded 
data bits in the data register 1182 which follow a marker. This synchronous behavior of 

10 the marker position bits in the marker register 1183 and the data bits in the data register 
1182 is used in the control block 1185 for detection and removal of padding bits, as well 
as for passing marker positions to the output of the decoder in a way synchronous with the 
decoded data. As mentioned, the two preshiflers (data 1172 and marker 1173), 
postshifters (data 1180 and marker 1181) and registers (data 1182 and marker 1183) get 

15 the same control signals which facilitates fully parallel and synchronous operation. 

The decoding unit 1184, also shown in Fig. 89 gets the sixteen most significant 
bits of the data register 1182 which are driven to a combinatorial decoding unit 1184 for 
extraction of a decoded Huffman value, the length of the present input code being 
decoded and the length of the additional bits following immediately the input code (which 

20 is a function of the decoded value). The length of the additional bits is known after the 
corresponding preceding Huffman symbol is decoded, so is the starting position of the 
next Huffman symbol. This effectively requires, if speed of one value decoded per clock 
cycle is to be maintained, that decoding of a Huffman value is done in a combinatorial 
block. Preferably, the decoding unit comprizes four PLA style decoding tables hardwired 

25 as a combinatorial block taking a 16-bit token on input from the data register 1182 and 
producing a Huffman value (8 bits), the length of the corresponding Huffman-encoded 
symbol (4 bits) and the length of the additional bits (4 bits) as illustrated in Fig. 89. 

Removal of padding bits takes place during the actual decoding when a sequence 
of padding bits is detected in the data register 1182 by a decoder of padding bits which is 

30 part of the control unit 1185. The decoder of padding bits operates as shown in Fig. 90. 
Eight most significant bits of the marker register 1183, 1242 are monitored for presence 
of a marker position bit If a marker position bit is detected, all the bits in the data 
register 1182, 1241 which correspond to, that is have the same positions as, the bits 
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preceding the marker bit in the marker register 1242 are recognized as belonging to a 
curren* oadding zone. The content of the current padding zone is checked by the 
detector of padding bits 1243 for Vs. If all the bits in the current padding zone are l's, 
they are recognized as padding bits and are removed from the data register. Removal is 

.5 d<. :\c by rnesns of shifting of tho con:, 'vs of ctata regis-er 1382, l?Al (anr th 

time the marker register 1183, 1242) to the left using the respective shifters 1172, 1173, 
1180. 1181 in one clock cycle, as in norma! decode mode with the difference that no 
decoded value is outputted. If not ail the bits, in the current padding zone are l's, a 
normaL decode cycle is performed rather than a padding bits removal cycle. Detection of 

10 padding bits takes place each cycle as described, in case there are some padding bits in the 
data register 1 182 to be removed. 

The control unit 1185 is shown in detail in Fig. 87. The central part of the 
control unit is the register 1223 holding the current number of valid bits in the data 
register 1182. The number of valid bits in the marker register 1183 is always equal to 

15 the number of valid bits in the data register 1182. The control unit preforms three 
functions. Firstly, it calculates a new number of bits in the data register 1182 to be stored 
in the register 1223. Secondly, it determines control signals for the shifters 1172, 1173, 
1180, 1181, 1186, 1187 decoding unit 1184, and the output formatter 1188. Finally, it 
detects padding bits in the data register 1182, as described above. 

20 The new number of bits in the data register 1182 (new_nob) is calculated as the 

current number of bits in the data register 1182 (nob) plus the number of bits (nos) 
available for loading from the stripper 1171 in the current cycle, less the number of bits 
(nor) removed from the data register 1182 in the current cycle, which is either a decode 
cycle or a padding bits removal cycle. The new number of bits is calculated as follows: 

25 ncw_nob = nob + nos - nor 

Hie respective arithmetic operations are done in adder 1221 and subtracter 1222. 
It should be noted that (nos) can be 0 if there is no data available from tbe stripper 1171 in 
the current cycle. Also, (nor) can be 0 if there is no decoding done in the current cycle 
because of shortage of bits in the data register 1182, which means there are less bits in the 

30 data register than the sum of th© current code length and the following additional bits 
length as delivered by the control unit 1185. The value (new_nob) may exceed 64 and 
block 1224 checks for this condition. In such a case, the stripper 1171 is stalled and no 
new data is loaded. Multiplexer 1233 is used for zeroing the number bf bits to be loaded 
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from the stripper 1171. A corresponding signal for stalling the stripper 1171 is not 
shown. Signal "padding cycle" driven by decoder 1231 controls multiplexer 1234 to 
select either the number of padding bits or the number of decoded bits (that is the length 
of code bits plus additional bits) as number of bits to be removed (nor). If the number of 

5 the decoded bits is greater than the number (nob) of the bits in the data register, which is 
checked in comparator 1228, the effective number of bits to shift as provided for 
multiplexer 1234 is set to zero by a complex NAND gate 1230. As a result, (nor) is set 
to zero and no bits are removed from the data register. The output of multiplexer 1234 is 
also used to control postshiflers 1182 and 1183. The width of the data register 1182 

10 must be chosen in a way preventing a deadlock situation. This means that at any time 
either there needs to be room in the data register to accommodate the maximum number 
of bits available from the stripper 1171 or sufficient number of valid bits to be removed as 
a result of a decode or a padding of bits removed cycle. 

Calculation of the number of bits to be removed in a decode cycle is performed 

15 by adder 1226. Its operands come from the combinatorial decoding unit 1184. As the 
code length of 16 bits is coded as "0000" by the decoding unit, "orjeduce" logic 1225 
provides encoding of "0000" into "10000", yielding a correct unsigned operand. This 
operand together with the output of subtractor 1227 provide control signals to the output 
formatting shifters 1186 and 1187. 

20 Block 1229 is used for detection of EOl (End Of Image) markeT position. The 

EOI marker itself is removed by the stripper 1171, but there can be some padding bits 
which are the very last bits of the data and which used to precede the EOI marker before 
its removal in the stripper 1171. The comparator 1229 checks if the number of bits in 
the data register 1182, stored in register 1223 is less than eight. If it is, and there is no 

25 more data to come from tie stripper 1171 (that is the data register 1182 holds all the 
remaining bits for of the data unit being decoded), the remaining bits define the size of the 
padding zone before the removed EOI marker. Further handling of the padding zone and 
possible removal of padding bits is identical to the procedure applied in case of padding 
bits before RST markers, which has been described before. 

30 Barrel shifters 1186, 1187 and output formatter 1188 play a support role and 

depending on the embodiment may have a different implementation or may not be 
implemented at all. Control signals to them come from the control unit 1185, as 
described above, the abjneshifter (additional bits preshifter) 1186 takes 32 bits from 
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the data register as input and shifts them to the left by the length of the Huffman code 
being presently decoded. In this way, all the additional bits following the code being 
presently decoded appear left aligned on the output of the barrel shifter 1186 which is also 
the input to the barrel shifter U87. The abjostshifter (additional bits postshifter) 1187 

5 ;.:justs the pos :ioa of the \ ^ al bus from left aligned tr. -ight aligned in an I ' 
field, as used in the output format of the data and shown in Fig. 91- The additional bits 
field extends from bit 8 to bit 18 in the output word format 1196 and some of the, most 
significant bits may be invalid, depending on the actual number of the additional bits. 
This number in encoded on bits 0 to 3 of 11 96, as specified by the JPEG standard. If a 

LO different format of the output data is adopted, the barrel shifters 1186 and 1187 and their 
functionality may change accordingly. 

The output formatter block 1188 packs the decoded values, which in JP£G 
standard are DC and AC coefficients, (1196, bits 0 to 7) and a DC coefficient indicator 
(1196, bit 19) passed by the control unit 1185 together with the additional bits (1196 ; bits 

15 8 to 18) passed by the ab_postshifter 1187 and the marker position bit (1196, bit 23) from 
the marker register 1183 into words according to the format presented in Fig. 91. The 
output formatter 1188 also handles any particular requirements as to the output interface 
of the decoder. The implementation of the output formatter is normally expected to 
change if the output interface changes as a result of different requirements. 

20 The foregoing described Huffman decoder provides a highly effective form of decoding 
providing a high speed decoding operation. 
3.17.8 Image Transformation Instructions 

These instructions implement general afQne transformations of source Images. 
The operation to construct a portion of a transformed image falls generally into two broad 

25 areas. These include firstly working out which parts of the source image are relevant to 
constructing the current output scanline and, it necessary, decompressing them. The 
second step normally comprizes necessary sub-sampling and/or interpolation to construct 
the output image on a pixel by pixel basis. 

Turning to Fig. 92, there is illustrated a flow chart of the steps required 720 to 

30 calculate the value of a destination pixel assuming that the appropriate sections of the 
source image have been decompressed. Firstly, the relevant sub-sampling* if present, 
must be taken into account 721. Next, two processes arc normally implemented, one 
involving interpolation 722 and the other being sub-sampling. Normally interpolation 
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and sub-sampling are alternative steps, however in same circumslances interpolation aod 
sub-sampling may be used together. In the interpolation process, the first step is to find 
the four surrounding pixels 722, then determine if pie-multiplication is required 723, 
before performing bilinear interpolation 724. The bilinear interpolation step 724 is often 
5 computationally intensive and limits the operation of the image transformation process. 
The final step in calculating a destination pixel value is to add together the possibly 
bilinear interpolated sub-samples from the source image. The added together pixel 
values can be accumulated 727 in different possible ways to produce destination image 
pixels of 728. 

10 The instruction word encoding for image transformation instructions is as 

illustrated in Fig. 93 with the following interpretation being placed on the minor opcode 
fields. 



Table 19: Instruction Word - Minor Opcode Fields 



Field 


Description 


S 


0 = bi-lineax interpolation is used on the four 
surrounding source image pixels to determine the 
actually sampled value 

1 = sampled value is snapped to the closest source image 
pixel value 


off[3:0] 


0 = do not apply the offset register (mdp_por) to the 
corresponding channel 

1 s apply the offeet register (mdp_por) to the 
corresponding channel 


P 


0 = do not pre-multiply source image pixels 

1 1= pre-multiply source image pixels 

. , — — 


C 


0 = do not clamp output values 

1 = clamp output underflows to 0x00 and overflows to 




OxFF 


A 


0 a do not take absolute value of output values 

1 = take absolute value of output values before wrapping 
or clamping 
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The instruction operand and result fields are interpreted as follows: 



Table 20: Instruction Operand and Results Word 



Operand 


Description 


Internal 
Format 


External 
Format 


Operand A 


kernel descriptor 




short or long kernel 
descriptor table 


Operand B 


Source Image Pixels 


other 


image table format 


Operand C 


unused 






Result 


pixels 


pixies 


packed stream, unpacked 
bytes 



5 

Operand A points to a data structure known as a "kernel descriptor" that 
describes all the information required to define the actual transformation. This data 
structure has one of two formats (as defined by the L bit in the A descriptor). Fig. 94 
illustrates the long form of kernel descriptor coding and Fig. 95 illustrates the short form 
10 of encoding. The kernel descriptor describes: 

1. Source image start co-ordinates 730 (unsigned fixed point, 24,24 resolution). 
Location (0,0) is at the top left of the image. 

2. Horizontal 731 and vertical 732 (sub-sample) deltas (2's complement fixed point, 
24.24. resolution) . 

15 3. A3 bit bp field 733 defining the location of the binary point within the fixed 
point matrix co-efficients as described hereinafter. 

4. Accumulation matrix co-efficients 735 (if present). These are of •Variable" 
point resolution of 20 binary places (2's complement), with the location of the binary 
point implicitly specified by the bp field. 
20 5. An rl field 736 that indicates the remaining number of words in the kernel 
descriptor. This value is equal to the number of rows times the number of columns 
minus 1. 




( 401 ) 



4$gff¥l 1 -8 5 9 6 9 



- 130 - 

The kernel co-efficients in the descriptor are listed row by row, with elements of 
alternate rows listed in reverse direction, thereby forming a zig zag pattern. 

Turning now to Fig. 96, the operand B consists of a pointer to an index table 
indexing into scan lines of a source image. The structure of the index table is as 
5 illustrated in Fig. 96, with the operand B 740 pointing to an index table 741 which in turn 
points to scan lines (eg. 742) of the required source image pixels. Typically, the index 
table and the source image pixels are cacheable and possibly located in the local memory. 

The operand C stores the horizontal and vertical sub-sample rate. The 
horizontal and vertical sub-sample rates are defined by the dimensions of the sub-sample 
10 weight matrix which are specified if the C descriptor is present. The dimensions of the 
matrix r and c are encoded in the data word of the image transformation instruction as 
illustrated in Fig. 97. 

Channel N of a resultant pixel P[N] is calculated in accordance with the 
following equation: 

15 

p[n] = (/.o/fee(n]-mdp par :0000) + 2 2 ^ ^x+rAx,y + cAy)[n] 

r c 

Internally, the accumulated value is kept to 36 binary places per channel. The location of the 
binary point within this field is spcciFied by the BP field. The BP field indicates the number of leading bits 
20 in the accumulated result to discard. The 36 bit accumulated value is treated as a signed 21s compliment 
number and is clamped or wrapped as specified. In Fig. 98, there ib illustrated an example of the 
interpretation of the BP field In co-efficient encoding. 

3,17.9 Convolution Instructions 

Convolutions, as applied to rendering images, involves applying a two 
25 dimensional convolution kernel to a source image to produce a resultant image. 
Convolving is normally used fox such matters as edgp sharpening or indeed any image 
filter. Convolutions are implemented by the co-processor 224 in a similar manner to 
image transformations with the difference being that, in the case of transformations the 
kernel is translated by the width of the kernel for each output pixel, in the case of 
30 convolutions* the kernel is moved by one source pixel for each output pixel. 

If a source image has values S(x,y) and a n x m convolution kernel has values 
C(x,y), then the nth channel of the convolution H[n] of S and C is given by: 



( 402 ) 



1-85969 



- 131 - 

where i € [0,c] and j G [0,r]. 

The interpretation of the offset value, the resolution of intermediate results and the interpretation 
5 of the bp field are the same as for Image Transformation instructions. 

In Fig. 99, there is illustrated an example of how a convolution kernel 750 is applied to a source 
image 751 to produce a resultant image 752. Source image address generation and output pixel 
calculations are performed in a similar manner to that for image transformation instructions. The 
instruction operands take a similar form to image transformations. In Fig. 100, there is illustrated the 
10 instruction word encoding for convolution instructions with the following interpretation being applied to the 
various fields. 



Table 21: Instruction Word 



1 Field 


Description 


S 


0 = bi-linear interpolation is used on the four surrounding source 
image pixels to determine the actually sampled value 

1 = sampled value is snapped to the closest source image pixel 
value 


C 


0 = do not clamp resultant vector values 

1 = clamp result vector values: underflow to 0x00, overflow to 
OxFF 


P 


0 = do not pre-multipiy input pixels 

1 = pre multiply input pixels 


A 


0 = do not take absolute value of output values 

1 » take absolute value of output values before wrapping or 

clamping 


off[3:0] 


0 = do not apply the offset register to this channel 

1 = apply the offset register to this channel 
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3.17.10 Matrix Multiplication 

Matrix multiplication is utilized for many things including being utilized for 
color space conversion where an affine relationship exists between two color spaces. 
Matrix multiplication is defined by the following equation: 



hi 



"0,3 

b W \\ \i hi b \s 

b 2.0 b 2,\ hi b 2A 

b W fr 3,l hi b W fc 3,4 



The matrix multiplication instruction operands and resuLts have the following 

format: 



Table 22: Instruction Operand and Results Word 



Operand 


Description 


Internal 
Format 


External 
Format 


Operand A 


source image pixels 


pixels 


packed stream 


Operands 


matrix co-efficients 


other 


image table format 


Operand C 


unused 






Result 


pixels 


pixels 


packed stream, 
unpacked bytes 



The instruction word encoding for matrix multiplication instructions as 
illustrated m Fig. 101 with the following table summarising the minor opcode Gelds. 



Field 



Table 23: Instruction Word 



Description 



0 = do not clamp resultant vector values. 

1 = clamp resultant vector values: underflow to 0x00, 
overflow to OxFF 
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p 


0 = do not pre-multiply input pixels 

1 = pre-multiply input pixels 


A 


0 = do not take absolute value of output values 

1 = take absolute value of output values before wrapping or 
clamping 



3.17.11 Halftoning 

The co-piocessor 224 implements a multi-level dither for halftoning. Anything 

from 2 to 255 is a meaningful number of halftone levels. Data to be halftoned can be 
5 either bytes (ie. unmeshed or one channel from meshed data) or pixels (ie. meshed) as 

long as the screen is correspondingly meshed or unmeshed. Up to four output channels 

(or four bytes from the same channel) can be produced per clock, either packed bits (for 

bi-level halftoning) or codes (for more than two output levels) .which are either packed 

together in bytes or unpacked in one code per bye. 
I o The output half-toned value is calculated using the following formula: 

(px(!-l) + d)/255 

Where p is the pixel value (0<p<255), 1 is the number of levels (2<1<255) and d 
is the dither matrix value (Q<d<254), The operand encoding is as follows: 
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Table 24: Instruction Operand and Results Word 



Operand 


Description 


Internal 
Format 


External 
Format 


Operand A 


source image 
pixels 


pixels 


packed stream 




source image bytes 


packed bytes, 
unpacked bytes 


packed stream 


Operand B 


dither matrix co- 
efficients 


pixels, packed 
bytes, unpacked 
bytes 


packed stream, 
unpacked bytes 


Operand C 


unused 






Result 


halftone codes 


pixels, packed bytes 
unpacked bytes 


packed stream, 
unpacked bytes 



In the instruction word encoding, the minor op code specifies a number of 
halftone levels. The operand B encoding is for the halftone screen and is encoded in the 
5 same way as a compositing tile. 

3.17.12 Hlerarchial Image Format Decompression 

Hierarchial image format decompression involves several stages. These stages 
include horizontal interpolation, vertical interpolation, Huffman decoding and residual 
merging. Each phase is a separate instruction. In the Huffman decoding step, the 
10 residual values to be added to the interpolated values from the interpolation steps are 
Huffman coded. Hence, the JPEG decoder is utilized for Huffman decoding. 

In Fig. 102, there is illustrated the process of horizontal interpolation. The 
output stream 761 consists of twice as much data as the input stream 762 with the last 
data value 763 being replicated 764. Fig. 103 illustrates horizontal interpolation by a 
15 factor of 4. 

In the second phase of hierarchial Image format decompression, rows of pixels 
are up sampled by a factor of two or four vertically by linear interpolation. During this 
phase, one row of pixels is on operand A and the other row is on operand B. 

When vertically interpolating, either by a factor of two or four, the output data 



20 stream contains the same number of pixels as each input stream. In Fig. 104, there is 
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illustrated an example of vertical interpolation wherein two input data streams 770, 771 
are utilized to produce a first output stream 772 having a factor of two interpolation or a 
second output stream 773 having a factor of 4 interpolation. In the case of pixel 
interpolation, interpolation occuis separately on each of the four channels of four channel 

pixels. 

The residual merging process involves the bytewize addition of two streams of 
data- The first stream (operand A) is a stream of base values and the second stream 
(operand B) is a stream of residual values. 

In Fig. 105, there is illustrated two Input streams 780, 781 and a corresponding 
output stream 782 for utilising the process of residual merging. 

In Fig. 106 there is illustrated the instruction word encoding for hierarchial 
image format instructions with the following table providing the relevant details of the 
minor op code fields. 



Table 25: Instruction Word - Minor Opcode Fields 



If Field 


Description 


R 


0 = interpolation 

1 = residual merging 


1 V 


0 = horizontal interpolation II 

1 s vertical interpolation 


If 


0 = interpolate by a factor of 2 

1 = interpolate by a factor of 4 


c 


0 = do not damp resultant values 

1 = clamp resultant values: underflow to 0x00, overflow to 
OaePF 



3.17.13 Memory Copy Instructions 

These instructions are divided into two specifically disjointed groups, 
a. General purpose data movement instructions 

These instructions utilize the normal data flow path through the co-processor 224, 
comprising the input interface module, input interface switch 252, pixel organizer 246, 
JPEG coder 241, result organizer 249 and then the output interface module. In this case, 
the JPEG coder module sends data straight through without applying any operation* 
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Other instructions include data manipulation operations including: 

* packing and unpacking sub -byte values (such as bits, two bit values and 
four bit values) to a byte 

• packing and unpacking bytes within a word 
5 • aligning 

• meshing and unmeshing 

• byte lane swapping and duplicating 

* memory clearing 

* replicating values 

10 The data manipulation operation is carried out by a combination of the pixel 

organizer (on input) and the result organizer (on output). In many cases, these 
instructions can be combined with other instructions, 
b. Local DMA instructions 

No data manipulation takes place. As seen in Fig. 2 data transfer occurs (in 

15 either direction) between the Local Memory 236 and the Peripheral Interface 237. These 
instructions are the only ones for which execution can be overlapped with some other 
instruction. A maximum of one of these instructions can execute simultaneously with a 
"non overlapped" instruction. 

In memory copy instructions, operand A represents the data to be copied and the 

20 result operand represents the target address of the memory copy instructions. For 
general purpose memory copy instructions, the particular data manipulation operation is 
specified by the operand B for input and operand C for output operand words. 
3.17.14 Flow Control Instructions 

The flow control instructions are a family of instructions that provide control 

25 over various aspect of the instruction execution model as described with reference to Fig. 
9. The flow control instructions include both conditional and unconditional jumps 
enabling the movement from one virtual address to another when executing a stream of 
instructions. A conditional jump instruction is determined by taking a co-processor or 
register, masking off any relevant fields and comparing it to given value. Tins provides 

30 for reasonable generality of instructions. Further, flow control instructions include wait 
instructions which are typically used to synchronize between overlapped and non- 
overlapped instructions or as part of micro-programming. 
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In Fig. 107, there is illustrated instruction when encoding for flow control 
instructions with the minor opcodes being interpreted as follows: 



Table 26: Instruction Word - Minor Opcode Fields 



Field 


Description 


type 


00 = jump 

01 = wait 


C 


0 = unconditional jump 

1 = condition jump 


s 


0 = use Operand B as Condition Register and 
Operand C as Condition mask 

1 = any interrupt condition set 


N 


0 = jump if condition is true 

1 = dont jump if condition is true 


O 


0 - wait on non-overlapped instruction to finish 

1 = wait on overlapped instruction to finish 



5 

In respect of Jump Instructions, the operand A word specified the target address 
of (he jump instruction. If the S bit of the Minor Opcode is set to 0, then operand B 
specified a co-processor register to use as the source of the condition. The value of the 
operand B descriptor specifies the address of the register, and the value of the operand B 
10 word defines a value to compare the contents of the register against. The operand C 
word specifies a bitwize mask to apply to the result. That is, the Jump Instruction's 
condition is true of the bitwize operation: 

({(regisler_value xor Operand B) and Operand C) => 0x00000000) 
Further instructions are also provided for accessing registers for providing full 
15 control at the micro programmed level. 
3.18 Modules of the Accelerator Oud 

Turning again to Kg. 2. there wiH now be provided further separate description 
of the various modules. 
3.18.1 Pixel Organizer 
20 The pixel organizer 246 addresses and buffers data streams from the input 

interface switch 252. The input data is stored in the pixel organizer's internal memory or 
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buffered to the MUV buffer 250. Any necessary data manipulation is performed upon 
the input stream before it is delivered to the main data path 242 or JPEG coder 241 as 
required, The operating modes of the pixel organizer are configurable by the usual CBus 
interface. The pixel organizer 246 operates in one of five modes, as specified by a 
5 PO_CFG control register. These modes include: 

(a) Tdlc Mode - where the pixel organizer 246 is not performing any operations. 

(b) Sequential Mode - when input data is stored in an internal FIFO and trie pixel 
organizer 246 sends out requests for data to the input interface switch 252, generating 32 
bit addresses for this data. 

10 (c) Color Space Conversion Mode - when the pixel organizer buffers pixels for color 
space conversion. In addition, requests are made for interval and fractional values stored 
in the MUV buffer 250, 

(d) JPEG Compression Mode - when the pixel organizer 246 utilizes the MUV 
buffer to buffer image data in the form of MClFs. 
15 (e) Convolution and Image Transformation Mode - when the pixel organizer 246 
stores matrix co-efficicnta in the MUV buffer 250 and passes them, as necessary, to the 
main data path 242. 

The MUV buffer 250 is therefore utilized by the pixel organizer 246 for both 
main data path 242 and JFEG coder 241 operations. During color space conversion, the 
20 MUV RAM 250 stores the interval and fractional tables and they arc accessed as 36 bits 
of data (four color channels) x (4 bit interval values and 8 bit fractional values). For 
image transformation and convolution, the MUV RAM 250 stores matrix co-efficients 
and related configuration data. The co-efficient matrix is limited to 16 rows x 16 
columns with each co-efficient being at a maximum 20 bits wide. Only one co-efficient 
25 per clock cycle is required from the MUV RAM 250. In addition to co-cfficicnt data, 
control information such as binary point, source start coordinates and sub-sample deltas 
must be passed to the main data path 242. This control information is fetched by the 
pixel organizer 246 before any of the matrix co-efficients are fetched. 

During JPEG compression, the MUV buffer 250 is utilized by Ihe pixel organizer 
30 246 to double buffer MCU ! s. Preferrably, the technique of double buffering is employed 
to increase the performance of JPEG compression. One half of the MUV RAM 250 is 
written to using data from the input interface switch 252 while the other half is read by the 
{rixel organizer to obtain data to send to the JPEG coder 241. The pixel organizer 246 is 
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also responsible for performing horizontal sub-sampling of color components where 

required and to pad MCLTs where an input, image does not have a size equal to an exact 
integral number of MCUs. 

The pixel organizer 246 is also responsible for formatting input data including 
5 byte lane swapping, normalization, byte substitution, byte packing and unpacking and 
replication operations as hereinbefore discussed with reference to Fig. 32 of the 
accompanying drawings. The operations are carried out as required by setting the pixel 
organizers registers. 

Turning now to Fig. 108, there is shown the pixel organizer 246 in more detail. 
10 The pixel organizer 246 operates under the control of its own set of registers contained 
within a CBus interface controller 801 which is interconnected to the instruction 
controller 235 via the global CBus, The pixel organizer 246 includes an operand fetch 
unit 802 responsible for generating requests from the input interface switch 252 for 
operand data needed by the pixel organizer 246. The start address for operand data is 

15 given by the PO_SAID register which must be set immediately before execution. The 
PO_SAID register may also hold immediate data, as specified by the L bit in the 
PO_DMR register. The current address pointer in stored in the PO_CDP register and is 
incremented by the burst length of any input interface switch request. When data is 
fetched into the MUV RAM 250, the current offset for data is concatenated with a base 

20 address for the MUV RAM 250 as given by the PL_MU V register. 

A FIFO 803 is utilized to buffer sequential input data fetched by the operand 
fetch unil 802. The data manipulation unit 804 is responsible for implementing for 
implementing the various manipulations as described with reference to Fig. 32. Hie 
output of the data manipulation unit is passed to the MUV address generator 805 which is 

25 itsrxmsiblc for passing data to the MUV RAM 250, main data path 242 or JPEG coder 
241 in accordance with configuration registers, A pixel organizer control unit 806 is a 
state machine that generates the required control signals for all the sub-modules in the 
pixel organizer 246. Included in these signals are those for controlling communication 
on the various Bus interfaces. The pixel organizer control unit outputs diagnostic 

30 information as required to the miscellaneous module 239 according to its status register 
settings. 

Turning now to Fig. 109, there is illustrated the operand fetch unit 802 of 
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generator (IAG) 810 which contains a state machine for generating requests to fetch 
operand data. These requests are sent to a request arbiter 811 which arbitiates between 
requests from the address generator 810 and those from the MUV address generator 805 
(Fig. 108) and sends the winning requests to the input (MAG) interface switch 252. The 
5 request arbiter 811 contains a state machine to handle requests. It monitors the state of 
the FIFO via FIFO count unit 814 to decide when it should dispatch the next request. A 
byte enable generator 812 takes information on the IAG 810 and generates byte enable 
patterns 816 specifying the valid bytes within each operand data word returned by the 
input interface switch 252. The byte enabled pattern is stored along with the associated 

10 operand data in the FIFO. The request arbiter 811 handles MAG requests before IAG 
requests when both requests arrive at the same time. 

Returning to Fig. 108, the MUV address generator 805 operates in a number of 
different modes. A first of these modes is the JPEG (compression) mode. In this mode, 
input data for JPEG compression is supplied by the data manipulation units 804 with the 

15 MUV buffer 250 being utilized as a double buffer. The MUV RAM 250 address 
generator 805 is responsible for generating the right addresses to the MUV buffer to store 
incoming data processed by the data manipulation unit 804. The MAG 805 is also 
responsible for generating read addresses to retrieve color component data from the stored 
pixels to form 8x8 blocks for JPEG compression. The MAG 805 is also responsible for 

20 dealing with the situation when a MCU lies partially on the image. In Fig. 110, there is 
illustrated an example of a padding operation carried out by the MAG 805. 

For normal pixel data, the MAG 805 stores the four color components at the 
same address within the MUV RAM 250 in four 8 bit rams. To facilitate retrieval of 
data from the same color channel simultaneously* the MCU data is barrel shifted to the 

25 left before it is stored in the MUV RAM 250. The number of bytes the data is shifted to 
the left is determined by the lowest two bits of the write address. For example, in Fig. 
Ill there is illustrated the data organization within the MUV RAM 250 for 32 bit pixel 
data when no sub-sampling is needed. Sub-sampling of input data maybe selected for 
three or four channel interleaved JPEG mode. In multichannel JPEG compression mode 

30 with subsampling operating, the MAG 805 (Fig. 108) performs the sub-sampling before 
the 32 bit data is stored in the MUV RAM 250 for optimal JPEG coder performance. 
For the first four incoming pixels, only the fust and fourth channels stored in the MUV 
kAM 250 contains useful data. The data in the second and third channel is sub-sampled 
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and stored in a register inside the pixel organizer 246. For the next four incoming pixels, 
the second and third channel are filled with sub-sampled data. In Fig. 112, there is 
illustrated an example of MCU data organization for multi-channel sub-sampling mode. 
The MAG treats all single channel unpacked data exactly the same as multi-channel pixel 
data. An example of single channel packed data as read from the MUV RAM is 

illustrated in Fig. 113. 

While the writing process is storing an incoming MCU into the MUV RAM, the 
reading process is reading 8x8 blockB out of the MUV RAM. In general, the blocks are 
generated by the MAG 805 by reading the data for each channel sequentially, four co- 
efficients at the time. For pixel data and unpacked input data, the stored data is 
organized as illustrated in Fig. 111. Therefore, to compose one 8 x 8 block of non- 
sampled pixel data, the reading process reads data diagonally from the MUV RAM. An 
example of this process is illustrated in Fig. 114, which shows the reading sequence for 
four channel data, the form of storage in the MUV RAM 250 assisting to read multiple 
values for the same channel simultaneously. 

When operating in color conversion mode, the MUV RAM 2S0 is used as a 
cache to hold the interval and fractional values and the MAG 805 operates as a cache 
controller. The MUV RAM 250 caches values for three color channels with each color 
channel containing 256 pairs of four bit interval and fractional values. For each pixel 
output via the DMU, the MAG 805 is utilized to get the values from the MUV RAM 250. 
Where the value is not available, the MAG 805 generates a memory read request to fetch 
the missing interval and fractional values. Instead of fetching one entry in each request, 
multiple entries are fetched simnltaneously for better utilization of bandwidth. 

For image transformation and convolution, the MUV RAM 250 stores the matrix 
crxsffidents for the MDP. The MAO cycles through all the matrix co-efficient stored in 
the MUV RAM 250. At the start of an Image transformation and convolution instruction, 
the MAG 805 generates a request to the operand fetch unit to fetch the kenial description 
"header" (Fig. 94) and the first matrix co-efficient in a burst request 

Turning now to Fig. 115, there is illustrated the MTJV address generator (MAG) 
805 of Fig. 108 in more detail. The MAG 805 includes an IBus request module 820 
which multiplexers IBus requests generated by an image transibrmation controller (TTX) 
821 and a color space conversion (CSC) controller 822. The requests are sent to the 
operand fetch unit which services the request. The pixel organizer 246 is only operated 
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either in image transformation or color space conversion mode. Hence, there is no 
arbitration required between the two controllers 821, 822. The IBus request module 820 
derives the information for generating a request to the operand fetch unit including the 
burst address and burst length from the relevant pixel organizer registers. 
5 A JPEG controller 824 is utilized when operating in JPEG mode and comprizes 

two state machines being a JPEG write controller and a JPEG read controller. The two 
controllers operate simultaneously and synchronize with each other through the use of 
internal registers. 

In a JPEG compression operation, the DMU outputs the MCU data which is 
10 stored into the MUV RAM. The JPEG Write Controller is responsible for horizontal 
padding and control of pixel subsampling, while the JPEG Read Controller is responsible 
for vertical padding. Horizontal padding is achieved by stalling the DMU output, and 
vertical padding is achieved by reading the previously read 8x8 block line. 

The JPEG Write Controller keeps track of the position of the cuirent MCU and 
15 DMU output pixel on the source image, and uses this information to decide when the 
DMU has to be stalled for horizontal padding. When a MCU has been written into the 
MUV RAM 250, the JPEG Write Controller sets/resets a set of internal registers which 
indicates the MCU is on the right edge of the image, or is at the bottom edge of the image. 
The JPEG Read Controller then uses the content of these registers to decide if it is 
20 required to perform vertical padding, and if it has read the last MCU on the image. 

The JPEG Write Controller keeps track of DMU output data, and stores the 
DMU output data into the MUV RAM 250. 

The controller uses a set of registers to record the current position of the input 
pixel. This information is used to perform horizontally padding by stalling the DMU 

25 OUtpUt. 

When a complete MCU lias been written into the MUV RAM 250, the controller 
writes the MCU information into JPEG-RW-1PC registers which is later used by the 
JPEG Read Controller. 

The controller enters the SLEEP state after the last MCU has been written into 
30 the MUV RAM 250, The controller stays in this state until the current instruction 
completes. 
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The JPEG Read Controller read the 8x8 blocks from the MCUs stored in the 
MUV RAM 250. For multi-channel pixels, the controller reads the MCU several times, 
each time extracting a different byte from each pixel stored in the MUV RAM. 

The controller detects if it needs to perform vertical padding using the 
5 information provided by the JPEG-RW-IPC. Vertical padding is achieved by re-reading 
the last 8-bytes read from the MUV RAM 250. 

The Image Transformation Controller 821 is responsible for reading the kernel 
discriptor from the IBus and passes the kernel header to the MDP 242, and cycles through 
the matrix co-efficients as many times as specified in the po.len register. All data output 
io by the PO 246 in an image transformation and Convolution instruction are fetched 
directly from the IBus and not passed through the DMU. 

The top eight bits of the first matrix co-efficient fetched immediately after the 
kernel header contains the number of remaining matrix co-efficients to be fetched 

The kernel header is passed to the MDP directly without modifications, whilst 
15 the matrix co -efficients are sign extended before they are passed to the MDP. 

The pixel sub-sampler 825 comprizes two identical channel sub-samplers, each 
operating on a byte from the inpul word. When the relevant configuration register is not 
asserted, the pixel sub-sampler copies its input to its output. When the configuration 
register is asserted, the sub-sampler sub-samples the input data either by taking the 
20 average or by decimation. 

An MUV multiplexer module 826 selects the MUV read and write signals from 
the currently active controller. Internal multiplexers are used to select the read addresses 
output via the various controllers that utilize the MUV RAM 250. An MUV RAM write 
address is held in an 8 bit register in an MUV multiplexer module. The controllers 
25 utilising the MUV RAM 250, load the write address register in addition to providing 
control for determining a next MUV RAM address. 

A MUV valid access module 827 is utilized by the color space conversion 
controller to determine if the interval and fractional values for a current pixel output by 
the data manipulation unit is available in the MUV RAM 250. When one or more color 
30 channels are missing, the MUV valid access module 827 passes the relevant address to the 
IBus request module 820 for loading in burst mode, interval and fractional values. Upon 
servicing a cache miss, the MUV valid access module 827 sets internal validity bits which 
map the set of Interval and fractional values fetched so far. 
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A replicate module 829 replicates the incoming data, the number of times as 
specified by an internal pixel register. The input stream is stalled while the replication 
module is replicating the current input word. A PBus interface module 630 is utilized to 
re-time the output signals of the pixel organizer 246 to the main data path 242 and JPEG 

5 coder 241 and vice versa. Finally, a MAG controller 831 generates signals for initiating 
and shutting down the various sub-modules. It also performs multiplexing of incoming 
PBus signals from the main data path 242 and JPEG coder 241. 
3.18.2 MUV Buffer 

Returning to Fig. 2, it will be evident from the foregoing discussion that the pixel 

10 organizer 246 interacts with the MUV buffer 250. 

The reconfigurable MUV buffo: 250 is able to support a number of operating 
modes including the single lookup table mode (modeO), multiple lookup table mode 
(model), and JPEG mode (mode2). A different type of data object is stored in the buffer 
in each mode. For instance, the data objects that are stored in the buffer can be data 

15 words, values of a multiplicity of lookup tables, single channel data and multiple channel 
pixel data. In general, the data objects can have different sizes. Furthermore, the data 
objects stored in the reconfigurable MUV buffer 250 can be accessed in substantially 
different ways which is dependent on the operating mode of the buffer. 

To facilitate the different methods needed to store and retrieve different types of 

20 data objects, the data objects are often encoded before they are stored. The coding 
scheme applied to a data object is determined by the size of the data object, the format 
that the data objects are to be presented, how the data objects are retrieved from the buffer, 
and also the organization of the memory modules that comprize the buffer. 

Fig. 116 is a block diagram of the components used to implement the 

25 reconfigurable MUV buffer 250. The reconfigurable MUV buffer 250 comprizes an 
encoder 1290, a storage device 1293, a decoder 1291, and a read address and rotate signal 
generator 1292. When a data object arrives from an input data stream 1295, the data 
object may be encoded into an internal data format and placed on the encoded input data 
stream 1296 by the encoder 1290. The encoded data object is stored in the storage 

30 device 1293. 



When decoding previously stored data objects, an encoded data object is read out 
of the storage device via encoded output data stream 1297. The encoded data object tn 
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the encoded output data stream 1297 is decoded by a decoder 1291. The decoded data 
objsct is then presented at the output data stream 1298. 

The write addresses 1305 to the storage device 1293 are provided by the MAG 
805 (Fig 108). The read addresses 1299, 1300 and 1301 are also provided by the MAG 
5 805 (Fig 108), and translated and multiplexed to the storage device 1293 by the Read 
Address and Rotate Signal Generator 1292, which also generates input and output rotate 
control signals 1303 and 1304 to the encoder and decoder respectively. The write enable 
signals 1306 and 1307 axe provided by an external source. An operating mode signal 
1302, which is provided by means of the controller 801(Fig 108), is connected to the 
10 encoder 1290, the decoder 1291, the Read Address and Rotate Signal Generator 1292, and 
the storage device 1293, An increment signal 1308 increments internal counters) in the 
read address and rotate signal generator and may be utilized in JPEG mode (mode2). 

Preferably, when the reoonfigurable MUV buffer 250 is operating in the single 
lookup table mode (modeO), the buffer behaves substantially like a single memory module. 
15 Data objects may be stored into and retrieved from the buffer in substantially the same 
way used to access memory modules. 

When the reconfigurable MUV buffer 250 is operating in the multiple lookup 
table mode (mode 1), the buffer 250 is divided into a plurality of tables with up to three 
lookup tables may be stored in the storage device 1293. The lookup tables may be 
20 accessed separately and simultaneously. For instance* in one example, interval and 
fraction values are stored in the storage device 1293 in the multiple lookup table mode, 
and the tables are indexed utilizing the lower three bytes of the input data stream 1295. 
Each of the three bytes are issued to access a separate lookup table stored in the storage 
device 1293. 

25 When an image undergoes JPEG compression, the imago is converted into an 

encoded data stream. The pixels are retrieved in the form of MCUs from the original 
image. The MCUs are read from left to right, and top to bottom from the image. Each 
MCU is decomposed into a number of single component 8x8 blocks. The number of 
8x8 blocks that can he extracted from a MCU depends on several factors including: the 

30 number of color components in the source pixels, and for a multiple channel JPEG mode, 
whether subsampling is needed. The 8x8 blocks are then subjected to forward DCT 
(FDCT), quantization, and entropy encoding. In the case of JPEG decompression, the 
encoded data are read sequentially from a data stream. The data stream undergoes 
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entropy decoding, dequantization and inverse DCT (DDCT). The output of the DDCT 
operation are 8x8 blocks. A number of single component 8x8 blocks are combined to 
reconstruct a MCU. As with JPEG compression, the number of single component 8x8 
blocks are dependent on the same factors mentioned above. The reconfigurable MUV 
5 buffer 250 may be used in the process to decompose MCUs into a multiplicity of single 
component 8x8 blocks, to reconstruct MCUs from a multiplicity of single component 8x8 
blocks. 

When the reconfigurable MUV buffer 250 is operating in JPEG mode (mode2), 
the input data stream 1295 to the buffer 250 comprizes pixels for a JPEG compression 

10 operation, or single component data in a JPEG decompression operation. The output 
data stream 1298 of the buffer 250 comprizes single channel data blocks for a JPEG 
compression operation, or pixel data in a JPEG decompression operation. In this example, 
for a JPEG compression operation, an input pixel may comprize up to four channels 
denoted Y, U, V and O. When the required number of pixels have been accumulated in 

15 the buffer to form a complete pixel block, the extraction of single component data blocks 
can commence. Each single component data block comprizes data from the Like channel 
of each pixel stored in the buffer. Thus in this example, up to four single component 
data blocks may be extracted from one pixel data block. In this embodiment, when the 
reconfigurable MUV buffer 250 is operating in the JPEG mode (mode2) for JPEG 

20 compression, a multiplicity of Minimum Coded Units (MCUs) each containing 64 single 
or 64 multiple channel pixels may be stored in the buffer, and a multiplicity of 64-byte 
long single channel component data blocks are extracted from each MCU stored in the 
buffer In this embodiment, for the buffer 1289 operating in the JPEG mode (mode2) for a 
JPEG decompression operations, the output data stream contains output pixels that have 

25 up to four components Y, U, V and O. When the required number of complete single 
component data blocks have been written into the buffer, the extraction of pixel data may 
commence. A byte from up to four single component block corresponding to different 
color components are retrieved to form an output pixel. 

Fig. 117 illustrates the encoder 1290 of Fig. 1 16 in more detail For the pixel 

30 block decomposition mode only, each input data object is encoded using a byte-wize 
rotation before it is stored into the storage device 1293 (Fig. 129). The amount of 
rotation is specified by the input rotate control signal 1303. As the pixel data has a 
maximum of four bytes in this example, a 32-bit 4-to-l multiplexer 1320 and output 1325 
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is used to select one of the four possible rotated versions of the input pixel. For example, 
if the four bytes in s pixel are labelled (3,2,1,0), the four possible rotated versions of ibis 
pixel are (3,2,1,0), (0,3,2,1), (1,0,3,2) and (2,1,0,3). The four encoded bytes are output 
1296 for storage in the storage device. 
5 When the buffer is placed in an operating mode other than the JPEG mode 

(mode2), for example, single lookup table mode (inodeO) and multiple lookup table mode 
(rrrjdel). hyte-wize rotation may not be necessary and may not be performed on the input 
data objects. The input data object is prevented from being rotated in the latter cases by 
overriding the input rotate control signal with a no-operation value. This value 1323 can 

10 be zero. A 2-to-l multiplexer 1321 produces control signals 1326 by selecting between 
the input rotate control signal 1303 and the no-operation value 1323. The current 
operating mode 1302 is compared with the value assigned to the pixel block 
decomposition mode to produce the multiplexer select signal 1322. The 4-to-l 
multiplexer 1320, which is controlled by signal 1326 selects one of the four rotated 

15 version of the input data object on the input data stream 1325, and produces an encoded 
input data object on the encoded input data stream 1326. 

Fig. 118 illustrates a schematic of a combinatorial circuit which implements the 
decoder 1291 for the decoding of the encoded output data stream 1297. The decoder 
1321 operates in a substantially similar manner to the encoder. The decoder only 

20 operates on the data when the data buffer is in the JPEG mode (xnode2). The lower 32- 
bit of an encoded output data object in the encoded output data stream 1297 is passed to 
the decoder. The data is decoded using a byte-wize rotation with an opposite sense of 
rotation to the rotation performed by the encoder 1290. A 32-bit 4-to-l multiplexer 
1330 is used to select one of the four possible rotated version of the encoded data* For 

25 example* if the four bytes in an input pixel are labelled (3,2,1,0), the four possible rotated 
version of this pixel are (3,2,1,0), (2,1,0,3), (1,0,3,2) and (0,3,2,1)- The output rotate 
control signal 1304 is utilized only when the buffer is in a pixel block decomposition 
mode, and when overridden by a no-operation value in other operating modes* The no 
operation value utilized 1333 is zero. A 2-to-l multiplexer 1331 produces signal 1334 

30 by selecting selects between the output rotate control signal 1304 and the no-operation 
value 1333* The current operating mode 1302 is compared with the value assigned to 
the pixel block decomposition mode to produce the multiplexer select signal 1332. The 
4-to-l multiplexer 1330, which is controlled by signal 1334, selects one of the four 
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rotated version of the encoded output data object on the encoded output data stream 1297, 
and produces an output data object on the output data stream 1298. 

Returning to Fig. 116, the method of internal read address generation used by the 
circuit is selected by the operating mode 1302 of the reconfigurable MUV buffer 250. 
For the single lookup table mode (modeO) and multiple lookup table mode (model), the 
read addresses are provided by the MAG 805 (Fig 108) in the form of external read 
addresses 1299, 1300, and 1301. For the single lookup table mode (modeO), the memory 
modules 1380, 1381, 1382, 1383, 1384 and 1385 (Fig. 121) of the storage device 1293 
operate together. The read address and the write address supplied to the memory 
modules 1380 to 1385 (Fig. 121) are substantially the same. Hence the storage device 
1293 only needs the external circuits to supply one read address and one write address, 
and uses internal logic to multiplex these addresses to the memory modules 1380 to 1385 
(Fig. 121). For modeO, the read address is supplied by the external read address 1299 
(Fig. 116) and is multiplexed to the internal read address 1348 (Fig. 121) without 
substantial changes. The external read addresses 1300 and 1301 (Fig. 116), and the 
internal read addresses 1349, 1350 and 1351 (Fig. 121), are not used in modeO. The 
write address is supplied by the external write address 1305 (Fig. 116), and is connected 
to the write address of each memory module 1380 to 1385 (Fig. 121) without substantial 
modification. 

In this example, a design that provides three lookup tables in the multiple lookup 
table mode (mode 1) is presented, the encoded input data is written simultaneously into 
all memory modules 1380 to 1385 (Fig, 121), while the three tables are accessed 
independently, and thus require one index to each of the three tables. Three indices, that 
is, read addresses to the memory modules 1380 to 1385 (Fig. 121), are supplied to the 
storage device 1293. These read addresses are multiplexed to the appropriate memory 
modules 1380 to 1385 using internal logic. In substantially the same manner as in the 
singje lookup table mode, the write address supplied externally is connected to the write 
address of each of the memory modules 1380 to 1385 without substantial modifications. 
Hence, for the multiple lookup table mode (mode 1), the external read addresses 1299, 
1300 and 1311 are multiplexed to internal read addresses 1348, 1349 and 1350 
respectively. The internal read address 1351 is not used in mode L The method of 
generating the internal read addresses need in the JPEG mode (mode 2) is different to the 
method described above. 
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Fig. 119 illustrates a schematic of a combinatorial circuit which implements the 
read address and rotate control s: : mals generation circuit 1292 (Fig. 116). for the 
reconfigurable data buffer operating in the JPEG mode (mode 2) for JPEG compression. 
In the JPEG mode (mode 2), the generator 1292 uses the output of a component block 
5 counter 1340 and the output of a data byte counter 1341 to compute 'he irr.Ci,^ i-rad 
addresses to the memory modules comprising the storage device 1293. The component 
block couator 1340 gi.vs?; the number comf \icnl !: r -:„!:s exTacr*'? fxc:n r 
block, which is stored in the storage device. The number of like components extracted 
from the pixel data block is given by multiplying the output of the data byte counter 1341 

10 by four. In this embodiment, an internal read address 1348, 1349, 1350 or 1351 for the 
pixel data block decomposition mode is computed as follows. The output of the 
component block counter is used to generate an offset value 1343, 1344, 1345, 1346 or 
1347, and the output of the data byte counter 1341 is used to generate a base read address 
1354. The offset value 1343 is added 1358 to the base read address 1354 and the sum is 

15 an internal read address 1348 (or 1349, 1350 or 1351). The offset values for the memory 
modules are in general different for simultaneous read operations performed on multiple 
memory modules, but the offset value to each memory module is in general substantially 
the same during the extraction of one component data block. The base addresses 1354 
used to compute the four internal read addresses in the pixel data block decomposition 

20 mode are substantially the same. The increment signal 1308 is used as the component 
byte counter increment signal- The counter is incremented after every successful read 
operation has been performed. A component block counter increment signal 1356 is 
used to increment the component block counter 1340, after a complete single component 
data block has been retrieved from the buffer. 

25 The output rotate control signal 1304 (Fig. 116) is derived from the output of the 

component block counter, and the output of the data byte counter, in substantially similar 
manner to the generation of an internal read address. The output of the component block 
counter is used to compute a rotation offset 1347. The output rotate control signal 1304 
is given by the lowest two bits of the sum of the base read address 1354 and the rotation 

30 offset 1355. The input rotate control signal 1303 is simply given by the lowest two bytes 
of the external write addresses 1305 in this example of the address and rotate control 
signals generator. 
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Fig. 120 shows another example of the address generator 1292 for reassembling 
multiple channel pixel data from single component data stored in the recanfigurable MU V 
buffer 250. In this case, the buffer is operating in the JPEG (mode2) for JPEG 
decompression operation. In this case, single component data blocks are stored in the 
5 buffer, and pixel data blocks are retrieved from the buffer, In this example, the write 
address to the memory modules are provided by the external write address 1305 without 
substantial changes. The single component blocks are stored in contiguous memory 
locations. The input rotate control signal 1303 in this example is simply set to the 
lowest two bits of the write address. A pixel counter 1360 is used to keep track of the 

10 number of pixels extracted from the single component blocks stored in the buffer. The 
output of the pixel counter is used to generate the read addresses 1348, 1349, 1350 and 
1351, and the output rotate control signal 1304. The read addresses are in general 
different for each memory module that comprize the storage device 1293. In this example, 
a read address comprizes two parts, a single component block index 1362, 1363, 1364 or 

15 1365, and a byte index 1361. An oflset is added to bit 3 and 4 of the output of the pixel 
counter to calculate the single component block index for a particular block. Hie offsets 
1366, 1367, 1368 and 1369 are in general different for each read address. Bit 2 to bit 0 
of the output of the pixel counter are used as the byte index 1361 of a read address. A 
read address is the result of the concatenation of a single component block index 1362, 

20 1363, 1364 or 1365 and a byte index 1361, as illustrated in Fig. 120. In this example, 
the output rotate control signal 1304 is generated using bit 4 and bit 3 of the output of the 
pixel counter without substantial change. The increment signal 1308 is used as the pixel 
counter increment signal to increment the pixel counter 1360. The pixel counter 1360 is 
incremented after a pixel has been successfully retrieved from (he buffer. 

25 Fig. 121 illustrates an example of a structure of the storage device 1293. The 

storage device 1293 can comprize three 4-bit wide memory modules 1383, 1384 and 1385, 
and three 8-bit wide memory modules 1380, 1381 and 1382. The memory modules can 
be combined together to store 36-bit words in the single lookup table mode (modeO), 
3xl2-bit words in the multiple lookup table mode (model), and 32-bit pixels or 4x8-bit 

30 single component data in JPEG mode (mode2). Typically each memory module is 
associated with a different part of the encoded input and output data streams (1296 and 
1297). For example, memory module 1380 has its data input port connected to bit 0 to 
bit 7 of the encoded input data stream 1296, and its data output port connected to bit 0 to 
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bit 7 of the encoded output data stream 1297. Ia this example, the write addresses to all 
the memory modules are connected together, and share substantially the c arne value. In 
contrast, the read addresses 1386, 1387, 1388, 1389, 1390 and 1391 to the memory 
modules of the example illustrated in Fig. 121 are supplied by the read address generator 
5 1292, and are in general different In the example, a common write enable signal is used 
to provide the write enable signals to all three 8-bit memory modules, and a second 
coniison wn.A enal" 1 : ;>ig.. .1 is u; ; piovidc : 'e enable ?fgr *; ~ : v;. ' 

memory modules. 

Fig. 122 illustrates a schematic of a combinatorial circuit used for generating 

10 read addresses 1386, 1387, 1388, 1389, 1390 and 1391 for accessing to the memory 
modules contained in a storage device 1293. Each encoded input data object is broken 
up into parts, and each part is stored into a separate memory module in the storage device. 
Hence, typically the write addresses to all memory modules for all operating modes are 
substantially the same and thus substantially no logic is required to compute the write 

15 address to the memory modules. The read addresses in this example* on the other hand, 
are typically different for different operations, and are also different to each memory 
module within each operating mode. All bytes in the output data stream 1298 of the 
reconfigurable MUV buffer 250 must contain single component data extracted from the 
pixel data stored in the buffer in the JPEG mode (mode2) for JPEG compression, or pixel 

20 data extracted from the single component data blocks stored in the buffer in the JPEG 
mode for JPEG decomposition. The requirements on the output data stream are 
achieved by providing four read addresses 1348, 1349, 1350 and 1351 to the buffer. In the 
multiple lookup table mode (model), up to three lookup tables are stored in the buffer, 
and thus only up to three read addresses 1348, 1349 and 1350 are needed to index the 

25 three lookup tables. Hie read addresses to all memory modules are substantially the 
same in the single lookup table mode (modeO), and only read address 248 is used in this 
mode. The example controller circuit shown in Fig. 122 uses the operating mode signals 
to the buffer, and up to four read addresses, to compute the read address 1386 - 1391 to 
each of the six memory modules comprising the storage device 1293. The read address 

30 generator 1292 takes, as its inputs, the external read addresses 1299, which comprizes 
external address buses 1348, 1349, 1350 and 1351, and generates the internal read 
addresses 1386, 1387, 1388, 1389, 1390 and 1391 to the memory modules that comprize 
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the storage device 1293. No manipulation on the external write addresses 1305 is 
required in the operation of this example. 

Fig. 123 illustrates a representation of an example of how 20-bit matrix 
coefficients may be stored in the buffer 250 when the buffer 250 is operating in single 

5 lookup table mode (modeO). In this example, typically no encoding is applied on the 
data objects stored in the cache when the data objects are written into the reconfigurable 
MUV buffer. The matrix coefficients are stored in the 8-bit memory modules 1380, 
1381 and 1382. Bit 7 to bit 0 of the matrix coefficient are stored in memory module 
1380, bit 15 to bit 8 of the matrix coefficient are stored in memory module 1381, and bit 

10 19 to bit 16 of the matrix coefficient are stored in the lower 4 bits of memory module 
1382* The data objects stored in the buffer may be retrieved as many times as required 
for the rest of the instruction. The write and read addresses to all memory modules 
involved in the single lookup table mode are substantially the same. 

Fig. 124 illustrates a representation of how the table entries are stored in the 

15 buffer in the multiple lookup table mode (model). In this example, up to three lookup 
tables may bo stored in the buffer, and each lookup table entry comprizes a 4-bit interval 
value and an 8-bit fraction value. Typically the interval values are stored in the 4-bit 
memory modules, and the fraction values are stored in the 8-bit memory modules. The 
three lookup tables 1410, 1411 and 1412 are stored in the memory banks 1380 and 1383, 

20 1381 and 1384, 1382 and 1385 in the example. The separate write enable control signals 
1306 and 1307 (Fig, 121) allow the interval values to be written into the storage device 
1293 without affecting the fraction values already stored in the storage device. In 
substantially the same manna:, the fraction values may be written into storage device 
without affecting the interval values already stored in the storage device. 

23 Fig. 125 illustrates a representation of how pixel data is stored in the 

reconfigurable MUV buffer 250 when the JPEG mode (mode2) for decomposing pixel 
data blocks into single component data blocks. The storage device 1293 is organized as 
four 8-bit memory banks, which comprizes the memory modules 1380, 1381, 1382, 1383 
and 1384, with 1383 and 1384 used together to operate substantially in the same manner 



30 as an 8-bit memory module. Memory module 1385 is not used in the JPEG mode 
(mode2). A 32-bit encoded pixel is broken up into four bytes, and each is stored into a 
different 8-bit memory module. 
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Fig. 126 illustrates a representation of how the single component data blocks are 
stored in the storage device 1293 in single component mode. The storage device 1293 is 
organized as four 8-bit memory banks, which comprizes the memory modules 1380, 1381, 
1382, 1383 and 1384, with 1383 and 1384 used together to operate substantially in the 

5 same manner as an 8-bit memory module. A single component block in this example 
comprizes 64 bytes. A different amount of byte rotation can be applied to each single 
component block when it is written into the buffer. A 32-bit encoded pixel data is 
retrieved by reading from the different single component data block stored in the buffer. 

For further details on the organization of the data within the MUV buffer 250 

10 reference is made herein to the section entitled Pixel Organizer. 

This preferred embodiment has shown that a reconfigurable data buffer may be 
used to handle data involved in different instructions. A reconfigurable data buffer that 
provides three operating modes has been disclosed. Different address generation 
techniques may be needed in each operating mode of the buffer. The single look-up 

15 table mode (modeO) may be used to store matrix coefficients in the buffer for an image 
transformation operation. The multiple look-up table mode (model) may be used to 
store a multiplicity of interval and fraction lookup tables in the buffer in a multiple 
channel color space conversion (CSC) operation. The JPEG mode (mode2) may be used 
either to decompose MCU data into single component 8x8 blocks, or to reconstruct MCU 

20 data from single-component 8x8 blocks, in JPEG compression and decompression 
operation respectively. 
3.18.3 Result Organizer 

The MUV buffer 250 is also utilized by the result organizer 249. The result 
organizer 249 buffers and formats the data stream from either the main data path 242 or 

25 the JPEG coder 241. The result topauztt 249 also is responsible for data packing and 
unpacking, denormalization, bjte lane swapping and realignment of result data as 
previously discussed with reference to Fig. 42. Additionally the result organizer 249 
transmits its results to the external interface controller 238, the local memory controller 
236, and the peripheral interface controller 237 as required. 

30 When operating in JPEG decompression mode, the results organizer 249 utilizes 

the MUV RAM 250 to double buffer image data produced by the JPEG coder 241. 
Double buffering increases the performance of the JPEG decompression by allowing data 
from the JPEG coder 241 to be written to one half of the MUV RAM 250 while at the 
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same time image data presently in the other half of the MUV RAM 250 is output to a 
desired destination. 

Tbe 1, 3 and 4 channel image data is passed to the result organizer 249 during 
JPEG decompression in a form of 8 x 8 blocks with each block consisting of 8 bit 

5 components from iuc s.:me channel. The result organizer stores these bloc?.:; \n the 
MUV RAM 250 in the order provided and then, for multi-channel interleaved images, 
meshing of the channels in performed when reading data from the MUV RAM 250. For 
example, in a three channel JPEG compression based on Y, U, V color space, the JPEG 
coder 241 outputs three 8x8 blocks, the first consisting of Y components, the second 

10 made of the U components and the third made up of the V components. Meshing is 
accomplished by taking one component from each block and constructing the pixel in the 
form of (YU VX) where X represents an unused channel. Byte swapping may be applied 
to each output to swap the channels as desired. The result organizer 249 must also do 
any required sub-sampling to reconstruct chroma-data from decompressed output. This 

15 can involve replicating each program channel to produce and an one. 

Turning to Fig. 127, there is illustrated the result organizer 249 of Fig, 2 in more 
detail. The result organizer 249 is based around the usual standard CBus interface 840 
which includes a register file of registers to be set for operation of the result organizer 249. 
The operation of the result organizer 249 is similar to that of the pixel organizer 246, 

20 however the reverse data manipulation operations take place. A data manipulation unit 
842 performs byte lane swapping, component substitution, component deselection and 
denormalization operations on data provided by tbe MUV address generator (MAG) 805. 
The operations carried out are those previously described with reference to Fig. 42 and 
operate in accordance with various fields set in internal registers. The FIFO queue 843 

25 provides buffering of output data before it is output via RBus control unit 844. 

The RBus control unit 844 is composed of an address decoder and state 
machines for address generation. The address for the destination module is stored in an 
internal register in addition to data on the number of output bytes required. Further, an 
internal RO_CUT register specifies how many output bytes to discard before sending a 

30 byte stream on the output bus. Additionally, a RO_LMT register specifies the maximum 
number of data items to be output with subsequent data bytes after the output limit being 
ignored. The MAG 805 generates addresses for the MUV RAM 250 during JPEG 
decompression. The MUV RAM 250 is utilized to double buffer output from the JPEG 
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decoder. The MAG 805 performs any appropriate meshing of components in the MUV 
RAM 250 in accordance with an internal configuration register and outputs single channel, 
three channel or four channel interleaved pixels. The data obtained from the MUV 
RAM 250 is then passed through the data manipulation unit 842, since byte lane swapping 

5 may need to be appi*"ad before pixel data is sent to the appropriate destination. When the 
results organizer 249 is not configured for JPEG mode, the MAG 805 simply forwards 
data from the PBus receiver 845 straight through to the data manipulation unit 842. 
3.18.4 Operand Organizers B and C 

Returning again to Fig. 2, the two identical operand organizers 247, 248 perform 

10 the function of buffering data from the data cache control 240 and forwarding the data to 
the JPEG coder 241 or the main data path 242. The operand organizers 247, 248 are 
operated in a number of modes: 

(a) Idle mode wherein the operand organizer only responds to CBus requests. 

(b) Immediate mode when the data of the current instruction is stored in an 
1 5 internal register of the operand organizer. 

(c) Sequential mode wherein the operator organizer generates sequential 
addresses and requests data from the data cache controller 240 whenever its input buffer 
requires filling. 

A number of modes of operation of the main data path 242 require at least one of 
20 the operand organizers 247, 248 to operate in sequential mode. These modes include 
compositing wherein operand organizer B 247 is required to buffer pixels which are to be 
composited with another image. Operand organizer C 248 is used for compositing 
operations for attenuation of values for each data channel. In halftoning mode, operand 
organizer B 247 buffers 8 bit matrix co-efficients and in hierarchial image format 
25 decompression mode the operand organizer B 247 buffers data for both vertical 
interpolation and residual merging instructions. 

(d) In constant mode, an operand organizer B constructs a single internal data 
word and replicates (his word a number of times as given by an internal register. 

(e) In tiling mode an operand organizer B buffers data that comprizes a pixel 

30 tile. 



(f) In random mode the operand organizer forwards addresses from the MDP 
242 or JPEG coder 241 directly to the data cache controller. These addresses are utilized 
to index the data cache 230, • > 
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An internal length register specifies the number of items to he generated by 
individual operand organizers 247, 248 when operated in sequential/titling/constant mode. 
Each operand organizer 247, 248 keeps account of the number of data items processed so 
far and stops when the count reaches the value specified in its internal register. Each 

5 operand organizer is further responsible for formatting input data via byte lane swapping, 
component substitution, packed/unpacked and normalization functions. The desired 
operations are configured utilising internal registers. Further, each operand organizer 
247, 248 may also be configured to constrict data items. 

Turning now to Fig, 128, there is illustrated the structure of operand organizers 

10 (247, 248) in more detail. The operand organizer 247, 248 includes the usual standard 
CBus interface and registers 850 responsible for the overall control of the operand 
organizer. Further, an OBus control unit 851 is provided for connection to the data 
cache controller 240 and is responsible for performing address generation for 
sequential/tile/ constant modes, generating control signals to enable communications on 

15 the OBus interface to each operand organizer 247, 248 and controlling data manipulation 
unit operations such as normalization and replication, that require the state to be saved 
from previous clock cycles of the input stream. When an operand organizer 247, 248 is 
operating in sequential or tiling mode, the OBus control unit 851 sends requests for data 
to the data cache controller 240, the addresses being determined by internal registers. 

20 Each operand organizer further contains a 36 bit wide FIFO buffer 852 used to 

buffer data from the data cache controller 240 in various modes of operation. 

A data manipulation unit 853 performs the same functions as the corresponding 
data manipulation unit 804 of the pixel organizer 246. 

A main data path/JPEG coder interface 854 multiplexer address and data to and 

25 from the main data path and JPEG coder modules 242, 241 in normal operating mode. 
The MDP/JC interface 854 passes input data from the data manipulation units 853 to the 
main data path and in the process may be configured to replicate this data. When 
operating in color conversion mode, the units 851, 854 are bypassed in order to ensure 
high speed access to the data cache controller 240 and the color conversion tables. 

30 3.18.5 Main Data Path Unit 

The aspects of the following embodiment relate to an image processor providing a 
low cost computer architecture capable of performing a number of image processing 
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computer architecture capable of being configured to perform image processing 
operations that arc not originally specified The image processor also seeks to provide a 
computer architecture having a large amount of identical logic, which simplifies the 
design process and lowers the cost of designing such an architecture. 

5 The computer architecture comprises a control register block, a decoding block, a 

data object processor, and flow control logic. The control register block stores all the 
relevant information ;jbout the image ;~r?ccssin£ operation, The decod'ng block decodes 
the information into configuration signals, which configure an input data object interface. 
The input data object interface accepts and stores data objects from outside, and 

10 distributes these data objects to the data object processor. For some image processing 
operations, the input data object interface may also generate addresses for data objects, so 
that the source of these data objects can provide the correct data objects. The data object 
processor performs arithmetic operations on the data objects received. The flow control 
logic controls the flow of data objects within the data object processing logic. 

15 More particularly, the data object processor can comprise a number of identical 

data object sub-processors, each of which processes part of an incoming data object. 
The data object sub-processor includes a number of identical multifunctional arithmetic 
units that perform arithmetic operations on these parts of data objects, post processing 
logic that processes the outgoing data objects, and multiplexer logic that connects the 

20 multifunctional arithmetic units and the post-processing unit together. The 
multifunctional arithmetic units contain storage for parts of the calculated data objects. 
The storage is enabled or disabled by the flow control logic- The multifunctional 
arithmetic units and multiplexer logic arc configured by the configuration signals 
generated by the decoding logic 

25 Furthermore, the configuration signals from the decoding logic can be overridden 

by an external programming agent Through this mechanism any multifunctional blocks 
and multiplexer logic can be individually configured by an external programming agent, 
allowing it to configure the image processor to perform image processing operations that 
are not specified beforehand. These and other aspects of the embodiments of the 

30 invention are described in greater detail hereinafter. 
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Returning to Fig. 2, as noted previously the main data path unit 242 performs all 
data manipulation operations and instructions other than JPEG data coding. These 
instructions include compositing color space conversion, image transformations, 
convolution, matrix multiplication, halftoning, memory copying and hierarchial image 

5 format decompression. The main data path 242 receives pixel and operand data from the 
pixel organizer 246, and operand organizers 247, 248 and feeds the resultant output to the 
result organizer 249. 

Fig.129 illustrates a block diagram of the main data path unit 242. The main 
data path unit 242 is a general image processor and includes input interface 1460, image 

10 data processor 1462, instruction word register 1464, instruction word decoder 1468, 
control signal register 1470, register file 1472, and a ROM 1475. 

The instruction controller 235 transfers instruction words to the instruction word 
register 1464 via bus 1454. Each instruction word contains information such as the kind 
of image processing operation to be executed, and flags to enable or disable various 

15 options in that image processing operation. The instruction word is then transferred to 
the instruction word decoder 1468 via bus 1465. Instruction controller 235 can then 
indicate to the instruction word decoder 1468 to decode the instruction word. Upon 
receiving that indication, the instruction decoder 1468 decodes the instruction word into 
contrul signals. Hiese control signals arc then transferred via bus 1469 to the control 

20 signal register 1470. The output of the control signal register is then connected to the 
input interface 1460 and image data processor 1462 via bus 1471. 

To add further flexibility to the main data path unit 242, the instruction controller 
235 can also write into the control signal register 1470. This allows anyone who is 
familiar with the structure of the main data path unit 242 to micro-configure the main data 

25 path unit 242 so that the main data path unit 242 will execute image processing operations 
that are not be described by any instruction word. 

In cases when all the necessary information to perform the desired image 
processing operation does not fit into the instruction word, the instruction controller 235 
can write all the other information necessary to perform the desired image processing 

30 operation into some of the selected registers in register file 1472, The information is 
then transferred to the input interface 1460 and the image data processor 1462 via bus 
1473. For some image processing operations, the input interface 1460 may update the 
contents of selected registers in the register file 1472 to reflect the current status of the 
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main data path unit 242. This feature helps the instruction controller 235 to fir.:* out 
what the problem is when there is a problem in executing an image processing open*., i. 

Once the decoding of the instruction word is finished, and/or the control signal 
register is loaded with the desired control signals, the instruction controller 235 can 

5 indicate to the main data p?th unit 242 to start performing the desired image pjoco^ns 
operation. Once that indication is received, the input interface 1460 begins to accept 

&i .bjects ccniag frou> bus 1451. De;^.-J;n^ c\ : Yr* ' r : : m.:;r- - . 

operation performed, the input interface 1460 may also begins to accept operand data 
coming from operand bus 1452 and/or operand bus 1453, or generates addresses for 

10 operand data and receive operand data from operand bus 1452 and/or operand bus 1453. 
The input interface 1460 then stores and rearranges the incoming data in accordance with 
the output of the control signal register 1470. The input interface 1460 also generates 
coordinates to be fetched via buses 1452 and 1453 when calculating such functions as 
affine image transformation operations and convolution. 

15 The image data processor 1462 performs the major arithmetic operations on the 

rearranged data objects from the input interface 1460. The image processor 1462 can: 
interpolate between two data objects with a provided interpolation factor; multiply two 
data objects and divide the product by 255; multiply and add two data objects in general; 
round off fraction parts of a data object which may have various resolutions; clamp 

20 overflow of a data object to some maximum value and underflow of a data object to some 
minimum value; and perform scaling and clamping on a data object. The control signals 
on bus 1471 control which of the above arithmetic operations are performed on the data 
objects, and the order of the operations. 

A ROM 1475 contains the dividends of 255/x, where x is from 0 to 255, rounded 

25 in 8.8 format Hie ROM 1475 is connected to the input interface 1460 and the image data 
processor 1462 via bus 1476. The ROM 1475 is used to generate blends of short lengths 
and multiply one data object by 255 and dividing the product by another data object 

Preferably, the number of operand buses eg 1452 is limited to 2, which is 
sufficient for most image processing operations. 

30 Fig. 130 illustrates the input interface 1460 in further detail. Input interface 1460 

includes data object interface unit 1480, operand interface units 1482 and 1484, address 
generation state machine 1486, blend generation state machine 1488, matrix 
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multiplication state machine 1490, inteipolation state machine 1490, data synchronizer 
1500, arithmetic unit 1496, miscellaneous register 1498, and data distribution logic 1505. 

Data object interface unit 1480 and operand interface units 1482 and 1484 are 
responsible to receive data objects and operands from outside. These interface units 

5 1482, 1484 are all configured by control signals from control bus 1515. These interface 
units 1482, 1484 have data registers within them to contain the data objects/operands that 
they have just received, and they all produce a VALID signal which is asserted when Lhe 
data within the data register is valid. The outputs of the data registers in these interface 
units 1482, 1484 are connected to data bus 1505. The VALID signals of these interface 

10 units 1482, 1484 are connected to flow bus 1510. When configured to fetch operands, 
operand interface units 1482 and 1484 accept addresses from arithmetic unit 1496, matrix 
multiplication state machine 1490 and/or the output of data register in data object 
interface unit 1480, and select amongst them the required address in accordance with the 
control signals from control bus 1515. In some cases, the data registers in operand 

15 interface units 1482 and 1484 can be configured to store data from the output of data 
register in data object interface unit 1480 or arithmetic unit 1496, especially when they are 
not needed to accept and store data from outside. 

Address generation state machine 1486 is responsible for controlling arithmetic 
unit 1496 so that it calculates the next coordinates to be accessed in the source image in 

20 affine image transformation operations and convolution operations. 

The address generation state machine 1486 waits for START signal on control 
bus 1515 to be set. When the START signal on control bus 1515 is set, address 
generation state machine 1486 then de-asserts the STALL signal to data object interface 
unit 1480, and waits for data objects to arrive. It also sets a counter to be the number of 

25 data objects in a kernel descriptor that address generation state machine 1486 needs to 
fetch, nac output of the counter is decoded to become enable signals for data registers in 
operand interface units 1482 and 1484 and miscellaneous register 1498. When the 
VALID signal from data object interface unit 1480 is asserted, address generation state 
machine 1486 decrements the counter, so the next piece of data object is latched into a 

30 different register. 



When the counter reaches zero, address generation state machine 1486 tells 
operand interface unit 1482 to start fetching index table values and pixels from operand 
interface unit 1484. Also, it loads two counters, one with the number of rows, another 
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with the number of columns. At every clock edge, when it is not paused by STALL 
signals from the operand interface unit 1482 or others, the counters are decremented to 
give the remaining rows and columns, and the arithmetic unit 1496 calculates the next 
coordinates to be fetched from. When both counters have reached zero, the counters 
5 r^u^d themselves wim the numocr r.-. and columns ag.^n, and arithmetic unit 1496 
is configured to find the top left hand corner of the next matrix. 

If interpolation is used to determine the true value of a pixel, address generation 
state machine 1486 decrements the number of rows and columns after every second clock 
cycle. This is implemented using a 1-bit counter, with the output used as the enable of 
10 the row and column counter. After the matrix is traversed around once, the state 
machine sends a signal to decrement the count in the length counter. When the counter 
reaches 1, and the final index table address is sent to the operand interface unit 1482* the 
state machine asserts a final signal, and resets the start bit 

Blend generation state machine 1488 is responsible for controlling arithmetic 
15 unit 1496 to generate a sequence of numbers from 0 to 255 for the length of a blend. 
This sequence of numbers is then used as the interpolation factor to interpolate between 
the blend start value and blend end value. 

Blend generation state machine 1488 determines which mode it should run in 
(jump mode or step mode). If the blend length is less than or equal to 256, then jump 
20 mode is used, otherwize step mode is used. 

The blend generation state machine 1488 calculates the following and puts them 
in registers (regO, rcgl, reg2). If a blend ramp is in step mode for a predetermined length, 
then latch 511-length in regO (24 bits), 512 - 2*length in reg 1 (24 bits), and end-start in 
reg 2 (4*9 bits). If the ramp is in jump mode, then latch 0 into regO, 255/(lengtb-l) into 
25 regl, and end-start into reg2 (4x9 bits). 

In step mode, the following operations are performed for every cycle: 
If regO > 0, then add regO with reg 1 and store the result in regO. Another 
incrcmentor can also be enabled so its output is incremented by 1. If regO <= 0, then add 
regO with 510 and store the result in regO. Ihcrementor is not incremented. Tlie output 
30 of the incrementor is the ramp value. 

In jump mode, the following is done for every cycle: 
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Add regO with regl. The Adder output is 24 bits, in fixed point format of 16.8. Store 
the adder output in regO« If the first bit of fraction result is 1, then increment the integer 
part. 

The least 8 bits of the integer part of the incrementor is the ramp value. The 
5 ramp value, the output of reg2, and the blend start value is then fed into the image data 
processor 1462 to produce the ramp. 

Matrix multiplication state machine 1490 is responsible for performing linear 
color space conversion on input data objects using a conversion matrix. The conversion 
matrix is of the dimension 4X5. The first four columns multiply with the 4 channels in 
10 the data object* while the last column contains constant coefficients to be added to the 
sum of products. When the START signal from control bus 1515 is asserted, matrix 
multiplication state machine does the following: 

1) It generates line numbers to fetch constant coefficients of the conversion 
matrix from buses 1482 and 1484. It also enables miscellaneous register 1498 to store 

15 these constant coefficients. 

2) It contains a 1-bit flipflop, which generates a line number which is used 
as an address to fetch half of matrix from buses 1482 and 1484. It also generates a 
"MAT_SEL" signal that selects which half of the data object to be multiplied with that 
half of matrix. 

20 3) It finishes when there is no data objects coming from data object 

interface unit 1480. 

Interpolation state machine 1494 is responsible for performing horizontal 
interpolation of data objects. During horizontal interpolation, main data path unit 242 
accepts a stream of data objects from bus 1451, and interpolates between adjacent data 
25 objects to output a stream of data objects which is twice or 4 times as long as the original 
stream. Since the data objects can be packed bytes or pixels, interpolation state machine 
1494 operates differently in each case to maximize the throughput. Interpolation state 
machine 1494 does the following: 

1) It generates INT_SEL signal to data distribution logic 1503 to rearrange 
30 the incoming data objects so that the right pair of data objects are interpolated. 

2) It generates interpolation factors to interpolate between adjacent pairs of 
data objects. 
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3) Ii generates a STALL signal to stop data object interface unit 1480 from 
accepting more data objects. This is necessary as the output stream is longer than the 
input stream. The STALL signal goes to flow bus 1510. 

Arithmetic unit 1496 contains circuitry for performing arithmetic calculations. 

5 II r configured by control sign ^Js on control bus 1515. si is used by two instructions 
only: affine image transformation and convolution, and blend generation in compositing. 

In affine imagp transformation and convolution, arithmetic unit 1496 is 
responsible fon 

1) Calculating the next x and y coordinates. To calculate x coordinates 
10 arithmetic unit 1496 uses an adder/subtractor to add/subtract the x part of horizontal and 

vertical delta to/from the current x coordinate. To calculate the y coordinates arithmetic 
unit 1498 uses an addcr/subtractor to add/subtract the y part of the horizontal or vertical 
delta to/from the current y coordinate. 

2) Adding the y coordinate to the index table offset to calculate the index 
15 tabic address. This sum is also incremented by 4 to find the next index table entry, when 

interpolation is used to find true value of a pixel. 

3) Adding the x coordinate to the index table entry to find the address of 
the pixel. 

4) Subtract 1 from the length count. 

20 In blend generation, arithmetic unit 1496 does the following: 

1) In step mode, one of the ramp adders is used to calculate an internal 
variable in the ramp generation algorithm, while the other adder is used to increment the 
ramp value when the internal variable is greater than 0. 

2) In jump mode, only one of the adders is required to add the jump value 
23 to the current ramp value. 

3) Round feacdoris oc^ 

4) Subtract start of blend from end of blend at the beginning of ramp 
generation. 

5) Subtract one from the length count 

30 Miscellaneous register 1498 provides extra storage space apart from the data 

registers in data object interface unit 1480 and operand interface units 1482 and 1484. It 
is usually used to store internal variables or as a buffer of past data objects from data 
object interface unit 1480, It is configured by control signals on control bus 1515. 
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Data synchronizer 1500 is configured by control signals on control bus 1515. It 
provides STALL signals to data object interface unit 1480 and operand interface units 
1482 and 1484 so thai if one of the interface units receives a piece of data object others 
have not, that interface unit is stalled until all the other interface units have received their 
5 pieces of data. 

Data distribution logic 1505 rearranges data objects from data bus 1510 and 
register file 1472 via bus 1530 in accordance with control signals on cc.itrcl bus 1515, 
including a MAT_SEL signal from matrix multiplication state machine 1490 and a 
INT.SEL signal from interpolation state machine 1494. The rearranged data is outputed 
10 onto bus 146L 

Fig. 131 illustrates image data processor 1462 of Fig. 129 in further detail. Image 
data processor 1462 includes a pipeline controller 1540, and a number of color channel 
processors 1545, 1550, 1555 and 1560. All color channel processors accept inputs from 
bus 1565, which is driven by the input interface 1460 (Fig. 131). All color channel 

15 processors and pipeline controller 1540 are configured hy control signals from control 
signal register 1470 via bus 1472. All the color channel processors also accept inputs 
from register file 1472 and ROM 1475 of Fig. 129 via bus 1580. The outputs of all the 
color channel processors and pipeline controller are grouped together to form bus 1570, 
which forms the output 1455 of image data processor 1462. 

20 Pipeline controller 1540 controls the flow of data objects within all the color 

channel processors by enabling and disabling registers within all the color channel 
processors. Within pipeline controller 1540 there is a pipeline of registers. The shape 
and depth of the pipeline is configured by the control signals from bus 1471, and the 
pipeline in pipeline controller 1540 has the same shape as the pipeline in the color 

25 channel processors. The Pipeline controller accepts VALID signals from bus 1565. 
For each pipeline stage within pipeline controller 1540, if the incoming VALID signal is 
asserted and the pipeline stage is not stalled, then the pipeline stage asserts the register 
enable signals to all color channel processors, and latch the incoming VALID signal. 
Hie output of the latch then a VALID signal going to the next pipeline stage. In this way 

30 the movement of data objects in the pipeline is simulated and controlled, without storage 
of any data. 

Color channel processors 1545, 1550, 1555 and 1560 perform the main 
arithmetic operations on incoming data objects, with each of them responsible for one of 
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the channels of the output data object. In the preferred embodiment the number of color 
channel processors is limited to 4, since most pixel data objects have a maximum of 4 
channels. 

One of the color channel processors processes the opacity channel of a pixel 
5 There is additional circuitry (not shown in Fig. 131), connected to the control bus 1471, 
which transforms the control signals from the control bus 1471 so that the color channel 
processor processes the opacity channel correctly, as for some imago y ocessing 
operations the operations on the opacity channel is slightly different from the operations 
on the color channels. 

10 Fig. 132 illustrates color channel processor 1545, 1550, 1555 or 1560 (generally 

denoted by 1600 in Fig. 132) in further detail. Each color channel processor 1545, 1550, 
1555 or 1560 includes processing block A 1610, processing block B 1615, big adder 1620, 
fraction rounder 1625, clamp-or-wrapper 1630, and output multiplexer 1635. The color 
channel processor 1600 accepts control signals from control signal register 1470 via bus 

is 1602, enable signals from pipeline controller 1540 via bus 1604, information from 
register file 1472 via bus 1605, data objects from other color channel processor via bus 
1603, and data objects from input interface 1460 via bus 1601. 

Processing block A 1610 performs some arithmetic operations on the data 
objects from bus 1601, and produces partially computed data objects on bus 1611. The 

20 following illustrates what processing block A 1610 does for designated image processing 
operations. 

In compositing, processing block A 1610 pre-multiplies data objects from data 
object bus 1451 with opacity, interpolates between a blend start value and a blend end 
value with an interpolation factor from input interface 1460 in Fig. 129, prc-multiplies 
25 operands from operand bus 1452 in Fig. 129 or multiplies blend color by opacity, and 
attenuates multiplication on pre-multiplied operand or blend color data. 

In general color space conversion, the processing block A 1610 interpolates 
between 4 color table values using two fraction values from bus 1451 in Fig. 129. 

In affine image transformation and convolution, the processing block A 1610 
30 pre-multiplies the color of the source pixel by opacity, and interpolates between pixels on 
the same row using the fraction part of current ^-coordinate. 
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In linear color space conversion, the processing block A 1610 pre-multiplies 
color of the source pixel by opacity, and multiplies pie-multiplied color data with 
conversion matrix coefficients. 

In horizontal interpolation and vertical interpolation, the processing block A 
5 1610 interpolates between two data objects. 

In residual merging, the processing block A 1610 adds two data objects. 

Processing block A 1610 includes a number of multifunction blocks 1640 and 
processing block A glue logic 1645. The multifunction blocks 1640 are configured by 
control signals, and may perform any one of the following functions: 
10 add/subtract two data objects; 

passing one data object; 

interpolate between two data objects with a interpolation factor; 
prc-multiply a color with an opacity; 

multiply two data objects, and then add a third data object to the product; and 
15 add/subtract two data objects, and then pre-multiply the sum/difference with an 

opacity. 

The registers within the multifunction blocks 1640 are enabled or disabled by 
enable signals from bus 1604 generated by pipelined controller 1540 in Fig. 131. 
Processing block A glue logic 1645 accepts data objects from bus 1601 and data objects 
20 from bus 1603, and the outputs of some of the multifunction blocks 1640, and routes them 
to inputs of other selected multifunction blocks 1640. Processing block A glue logic 
1645 is also configured by control signals from bus 1602. 

Processing block B 1615 performs arithmetic operations on the data objects from 
bus 1601, and partially computed data objects from bus 1611, to produce partially 
23 computed data objects on bus 1616. The following description illustrates what 
processing block B 1615 does for designated image processing operations. 

In compositing (with non-plus operators), the processing block B 1615 multiplies 
pre-processed data objects from data object bus 1451 and operands from operand bus 
1452 with compositing multiplicands from bus 1603, and multiplies clamped/wrapped 
30 data objects by output of the ROM, which is 255/opacity in 8.8 format. 

In compositing with plus operator, the processing block B 1615 adds two pre- 
processed data objects. In the opacity channel, it also subtracts 255 from the sum, 
multiplies an offset with the difference, and divides the product by 255. 
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In general color space conversion, the processing block B 1615 interpolates 
between 4 color table values using 2 of the fraction values from bus 1451, anu 
interpolates between partially interpolated color value from processing block A 1610 and 
the result of the previous interpolation using the remaining fraction value. 
5 In affine image transformation and convolution, the processing block B 1615 

interpolates between partially interpolated pixels using the fraction part of current y- 
coordlnate. and multiplies interpolated pixels with coefficients in ?- s**^- sample weight 
matrix. 

In linear color space conversion, the processing block B 1615 pTe-mu!tiplies the 
10 color of the source pixel by opacity, and multiplies pre-multiplied color with conversion 
matrix coefficients. 

Processing block B 1615 again includes a number of multifunction blocks and 
processing block B glue logic 1650. The multifunction blocks are exactly the same as 
those in processing block A 1610, but the processing block B glue logic 1650 accepts data 
15 objects from buses 1601, 1603, 1611, 1631 and the outputs of selected multifunction 
blocks and routes them to the inputs of selected multifunction blocks. Processing block 
B glue logic 1650 is also configured by control signals from bus 1602. 

Big adder 1620 is responsible for combining some of the partial results from 
processing block A 1610 and processing block B 1615. It accepts inputs from input 
20 interface 1460 via bus 1601, processing block A 1610 via bus 1611, processing block B 
1615 via bus 1616, and register file 1472 via bus 1605, and it produces the combined 
result on has 1 621. It is also configured by control signals on bus 1602. 

For various image processing operations, big adder 1620 may be configured 
differently. The following description illustrates its operation during designated image 
25 processing operations. 

In compositing with non-plus operators, the big adder 1620 adds two partial 
products from processing block B 1615 together. 

In compositing with plus operator, the big adder 1620 subtracts the sum of pre- 
processed data objects with offset from the opacity channel, if an offset enable is on. 
30 In affine image transformation/convolutiqn, the big adder 1620 accumulates the 

products from processing block B 1615 



In linear color space conversion, in the first cycle, the big adder adds the two 
matrix coefficients/data object products and the constant coefficient together, m the 
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second cycle, it adds the sum of last cycle with another two matrix coefficients/data object 
products together. 

Fraction rounder 1625 accepts input from the big adder 1620 via bus 1621 and 
rounds off the fraction part of the output. The number of bits representing the fraction 
5 part is described by a BP signal on bus 1605 from register file 1472. The following table 
shows how the BP signal is interpreted. The rounded output is provided on bus 1626. 



Table 27: Fraction Table 



bp field 


Meaning 


0 


Bottom 26 bits are fractions. 


1 


Bottom 24 bits are fractions. 


2 


Bottom 22 bits are fractions. 


3 


Bottom 20 bits are fractions. j 


4 


Bottom 18 bits are fractions. 


5 


Bottom 16 bits are fractions. 


6 


Bottom 14 bits are fractions. 


7 


Bottom 12 bits arc fractions. 



10 As well as rounding off fraction, fraction rounder 1625 also does two things: 

1) determines whether the rounded result is negative; and 

2) determines whether the absolute value of the rounded result is greater than 

255. 

Clamp-or-wrapper 1630 accepts inputs from fraction rounder 1625 via bus 1626 
15 and does the following in the order described: 

finds the absolute value of the rounded result, if such option is enabled; and 
clamps any underflow of the data object to the minimum value of the data object, 
and any overflow of the data object to the maximum value of the data object. 

Output multiplexer 1635 selects the final output from the output of processing 
20 block B on bus 1616 and the output of clamp-or-wrapper on bus 1631. It also performs 
some final processing on the data object. The following description illustrates its 
operation for designated image processing operations. 
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In compositing with non-plus operators and un-pre-multiplication, the 
multiplexer 1635 combines some of the outputs of processing block B 1615 to form the 
un-pre-multiplied data object. 

In compositing with non-plus operator and no un-pre-multiplication, the 
5 multiplexer 1635 passes on the output of cIarnp-e:-wTapper 1630. 

In compositing with plus operator, the multiplexer 1635 combines some of the 
outputs of processing block B 1630 to form resultant data object. 

In general color space conversion, the multiplexer 1635 applies the translate-and- 
clamp function on the output data object. 
10 In other operations, the multiplexer 1635 passes on the output of clamp-or- 

wrapper 1630. 

Fig. 133 illustrates a single multifunction block (e.g. 1640) in further detail. 
Multifunction block 1640 includes mode detector 1710, two addition operand logic units 
1660 and 1670, 3 multiplexing logic units 1680, 1685 and 1690, a 2-input adder 1675, a 
15 2-input multiplier with 2 addends 1695, and register 1705. 

Mode detector 1710 accepts one input from control signal register 1470, in Fig. 
129 the MODE signal 1711, and two inputs from input interface 1460, in Fig. 129 SUB 
signal 1712 and SWAP signal 1713. Mode detector 1710 decodes these signals into 
control signals going to addition operand logic units 1660 and 1670, and multiplexing 
20 logic units 1680, 1685 and 1690, and these control signals configure multifunction block 
1640 to perform various operations. There are 8 modes in multifunction block 1640: 

1) Add/sub mode: adds or subtract input 1655 to/from input 1665, in 
accordance with the SUB signal 1712. Also, the inputs can be swapped in accordance 
with the SWAP signal 693. 
25 2) Bypass mode: bypass input 1655 to output. 

3) Interpolate mode: interpolates between inputs 1655 and 1665 using input 
1675 as the interpolation factor. Inputs 1655 and 1665 can be swapped in accordance 
with the SWAP signal 1713. 

4) Pre-multiply mode: multiplies input 1655 with input 1675 and divide it by 
30 255, The output of the INC register 1708 tells the next stage whether to increment the 

result of this stage in bus 1707 to obtain the correct result. 

5) Multiply mode: multiplies input 1655 with 1675. 
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6) Add/subtract-and-pre-multiply mode; adds/subtracts input 1665 to/from 
input 1655, multiplies the sum/difference with input 1675. and then divide the product by 
255. The output of the INC register 1708 tells the next stage whether to increment the 
result of this stage in bus 1707 to obtain the correct result. 
5 Addition operand logic units 1660 and 1670 find one's complement of the input 

on demand, so that the adder can do subtraction as well. Adder 1675 adds the outputs of 
addition operand logic 1660 and 1670 in buses 1662 and 1672 together, and outputs the 
sum in bus 1677. 

Multiplexing logic 1680, 1685 and 1690 select suitable multiplicands and 
10 addends to implement, a desired function. They are all configured by control signals on 
bus 1714 from mode detector 1710. 

Multiplier with two addends 1695 multiplies input from bus 1677 with input 
from bus 1682, then adds the products to the sum of inputs from buses 1687 and 1692. 

Adder 1700 adds the least significant 8 bits of the output of multiplier 1695 with 
15 the most significant 8 bits of the output of multiplier 1695. Hie carryout of adder 1700 
is latched in INC register 1701. INC register 1701 is enabled by signal 1702. Register 
1705 stores the product from multiplier 1695. It is also enabled by signal 1702. 

Fig. 134 illustrates a block diagram for the compositing operations. The 
compositing operation accepts three input streams of data: 
20 1) The accumulated pixel data, which is derived from the same location as the 

result is stored to in this accumulator model. 

2) A compositing operand • which consists of color and opacity. Hie color 
and opacity can both be either flat, a blend, pixels or tiled. 

3) Attenuation * which attenuates the operand data. The attenuation can be 
25 flat, a bit map or a byte map. 

Pixel data typically consists of four channels. Three of these channels make up 
the color of the pixel. The remaining channel is the opacity of the pixel. Pixel data can 
be prc-multiplied or normal. When pixel data is pre-multiplied, each of the color 
channels are multiplied with the opacity. Since equations for oompositing operators are 
30 simple with pre-multiplied pixels, usually pixel data is pre-multiplied before it is 
composited with another pixel. 

The compositing operators implemented in the preferred embodiments are shown 
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multiplied pixel of color a c and opacity a 0 , r is the "offset" value and wc() is the 
wrapping/clamping operator the reverse operator of each of the over, in, out, atop 
operators in Table 1 is also implemented, and the compositing model has the accumulator 
on the left. 

5 Composite block 1760 in Fig. 134 comprises three color sub-blocks and a 

opacity sub-block. Each color sub-block operates on one color channel, and opacity 
channel ol the input pixels to obtain the color of the output pixel. The following psoido 
code shows how this is done. 

10 PIXEL Composite( IN colorA, colorB: PIXEL; 

IN opacity A, opacityB: PIXEL; 

IN comp_op: COMPOS ITE_OPERATOR 

) 

( 

15 PIXEL result; 

IF comp_pp is rover, rin, rout, ratop THEN 
swap colorA and colorB; 
swap opacityA and opacityB; 

END IF; 

20 IF comp-op is over or rover or loado or plus THEN 

X = l; 

ELSE IF comp_pp is in or rin or atop or ratop THEN 

X = opacityB; 
ELSE IF comp-op is out or rout or xor THEN 
25 X = not(opacityB); 

ELSE IF comp-op is loadzeio or loadc or loadco THEN 

X = 0 

END IF; 

IF comp-op is over or rover or atop or ratop or xor THEN 
30 Y = not(opacitya); 

ELSE IF comp_op is plus or loadc or loadco THEN 
Y = not(opacitya); 
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ELSE IF comp_pp is plus or loadc or loadco THEN 
Y = l; 

ELSE IF comp-op is in or rin or out or rout or 
loadzero or Loado THEN 
Y-0 

END IF; 

result = colorA * X + colorB * Y; 
RETURN result; 



10 The above pseudo code is different for the opacity sub-block, since the operators 

Hoade* and 'loado' have different meaning in the opacity channel. 

Block 1765 in Fig. 134 is responsible for clamping or wrapping the output of 
block 1760. When block 1765 is configured to clamp, it forces all values less than the 
minimum allowable value to the minimum allowed value, and all values more than the 
15 maximum allowed value to the maximum allowed value. If block 1765 is configured to 
wrap, it calculates the following equation: 

((x-min) mod (max-inin)) + min, 
whereby min and max are the minimum and maximum allowed value of the 
color respectively. Preferably the minimum value for a color is 0, and the maximum 
20 value is 255. 

Block 1770 in Fig, 134 is responsible for un-pre-multiplying the result from 
block 1765. It un-pre-multiplies a pixel by multiplying the pre-multiplied color value 
with 255/o s where o is the opacity after composition. The value 255/o is obtained from a 
ROM inside the compositing engine. The value stored in the ROM is in the format of 
25 8.8 and the rest of the fraction is rounded. The result of multiplication is stored in the 
format of 16.8. The result would be rounded to 8 bits to produce the un-pre-multiplied 
pixel. 

Blend generator 1721 generates a blend of a specified length with specified start 
and end values. Blend generation is done in two stages: 
30 1) ramp generation, and 

2) interpolation 



In ramp generation, the compositing engine generates a linearly increasing 
number sequence from 0 to 255 over the length of the instruction. There are two modes 
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in ramp generation: the "jump" mode, when the length is less than or equal to 255, and 
the "step" mode when the length is greater than 255. The mode is determined by 
examining the 24 most significant bits of the length. In the jump mode, the ramp value 
increases by at least one in every clock period. In the step mode, the ramp value 
5 increases by at most one in every clock period. 

In the jump mode, the compositing engine uses the ROM to find out the step 
value 255/(length-l), in 8.8 format. This valu^ is then added to a 16-bit accumulator. 
The output of the accumulator is rounded to 8 bits to form the number sequence. In the 
step mode, the compositing engine uses an algorithm similar to Bresenham's line drawing 
10 algorithm, as described by the following pseudo code. 

Void linedraw ( length: INTEGER 

) 

{ 

d = 511 - length; 
15 incrE = 510; 

incrNE = 512 - 2*length; 

ramp - 0; 

for (i=0; i(length; 

{ 

20 ifd(=0then 

d += incrE; 

else { 

d incrNE; 
ramp++; 

25 } 
} 

} 

After that, the following equation is calculated to generate the blend from the 

ramp. 

30 Blend = ((end - start) x ramp/255) + start 

The division by 255 is rounded. The above equation requires 2 adders and a 
block that "pre-multiplies" (end-start) by ramp for each channel. 
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Another image processing operation that the main data path unit 242 is able to 
perform is general color space conversion. Generalized Color Space Conversion 
(GCSC) uses piecewize tri-linear interpolation to find out the output color value. 
Preferably, conversion is from a three dimensional input space to one or four dimensional 
output space. 

In some cases, there is a problem with the accuracy of tri-linear interpolation at 
the edges of the color gamut. This problem is most noticeable in printing devices that 
have high sensitivity near an edge of the gamut; To overcome this problem, GCSC can 
optionally be calculated in an expanded output color space and then scaled and clamped 
to the appropriate range using the formula in equation: 

0 ifx(63 
out = 2(x-64) if(64(x(191) 
255 if(!92(x) 

Yet other image processing operations that the preferred embodiment is able to 
perform are image transformation and convolution. In image transformation, the source 
image is scaled, rotated, or skewed to form the destination image. In convolution, the 
source image pixels are sampled with a convolution matrix to provide the destination 
image. To construct a scanline in the destination image, the following steps are 
required: 

1 ) Perform an inverse transform of the scanline in the destination image back to 
the source image as illustrated in Fig, 135* This tells what pixels in the source image are 
needed to construct that scanline in the destination image. 

2) Decompress the necessary portions of the source image* 

3) Inverse-transform the starting x and y coordinates, horizontal and vertical 
subsompling distances in the destination image back to source image. 

4) Pass all these information to the processing units which performs the 
necessary sub-sampling and/or interpolation to construct the output image pixel by pixel. 

The calculations to work out which parts of the source image are relevant, sub- 
sampling frequencies to use, etc, are performed by the host application. Sub-sampling, 
interpolation, and writing the pixels into the destination image memory are done by the 
preferred embodiments. ! j ; 
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Fig. 136 shows a block diagram of the steps required to calculate the value for a 
destination pixel. la general, the computation-intensive part is the bi-Linear interpolation. 
The block diagram in Fig. 136 assumes that all Ihe necessary source image pixels are 
available. 

5 The final step in calculating a destination pixel is to add together all the possibly 

bt-linearly interpolated sub-samples from the source image. These values are given 
different weights. 

Fig. 137 illustrates a block diagram of the image transformation engine that can 

be derived from suitable settings within the main data path unit 242> Image 
10 transformation engine 1830 includes address generator 1831, pre-muitiplier 1832, 

interpolator 1833, accumulator 1834, and logic for rounding, clamping and finding 

absolute value 1835. 

Address generator 1831 is responsible for generating x and y coordinates of the 

source image which are needed to construct a destination pixel. It also generates 
15 addresses to obtain index offsets from an input index table 1815 and pixels from image 

1810. Before address generator 1831 begins generating x and y coordinates in the source 

image, it reads in a kernel descriptor. These arc two formats of kernel descriptors. 

They are shown in Fig. 138. The kernel descriptor describes: 

1) Source image start coordinates (unsigned fixed point, 24.24 resolution). 
20 Location (0,0) is at the top left of the image. 

2) Horizontal and vertical sub-sample deltas (2's complement fixed point, 
24.24 resolution). 

3) a 3 bit bp field defining the location of the binary point within the Gxed 
point matrix coefficients, The definition and interpretation of the bp field is shown in 

25 Fig. 150. 

4) Accumulation matrix coefficients. These are of "variable" point resolution 
of 20 binary places (2's complement), with the location of the binary point implicitly 
specified by the bp field. 

5) an rl field that indicates the remaining number of words in the kernel 
30 descriptor. This value is equal to the number of rows times the number of columns 

minus 1. 

For the short kernel descriptor, apart from the integer part of start x coordinate, 
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starting x coordinate fraction <-0, 
starting y coordinate <-0, 
horizontal delta <-1.0, 
vertical ddtaol.O. 

5 After address generator 1831 is configured it calculates the current coordinates. 

It does this in two different ways, depending on the dimensions of the subsaraple matrix. 
If the dimensions of the subsample matrix are lxl, address generator 1831 adds the 
horizontal delta to the current coordinates until it has generated enough coordinates. 

If the dimensions of the subsample matrix are not lxl, address generator 1831 

10 adds the horizontal delta to the current coordinates until one row of the matrix is finished. 
After that, address generator 1831 adds the vertical delta to the current coordinates to find 
the coordinates on the next row. After that, address generator 1831 subtracts the 
horizontal delta from the current coordinates to find the next coordinates, until one more 
row is finished. After that, address generator 1831 adds the vertical delta to the current 

15 coordinates and the procedure is repeated again. Top diagram in Fig. 150 illustrates this 
method of accessing the matrix. Using this scheme, the matrix is traversed in a zig-zag 
way, and fewer registers are required since the current x and y coordinates are calculated 
using the above method, the accumulation matrix coefficients must be listed in the kernel 
descriptor in the same order. 

20 After generating the current coordinates, the address generator 1831 adds the y 

coordinate to the index table base address to get the address to the index table. (In case 
when source pixels are interpolated, address generator 1831 needs to obtain the next index 
table entry as well.) The index table base address should point to the index table entry 
for y + 0. After obtaining the index offeet from the index table, the address generator 

25 1831 adds that to the x coordinate. The sum is used to get 1 pixel from the source image 
(or 2 if source pixels are interpolated). In case when source pixels are interpolated, the 
address generator 1831 adds the x coordinates to the next index offset, and two more 
pixels are obtained. 

Convolution uses a similar method to generate coordinates to image 



30 transformation. The only difference is that in convolution, the start coordinates of the 
matrix for the next output pixel is one horizontal delta away from the starting coordinates 
of the matrix for the previous pixel. In image transformation, the starting coordinates of 
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the matrix for the next pixel is one horizontal delta away from the coordinates of the top 
right pixel in the matrix for the previous output pixel. 

The middle diagrams in Fig. 139 illustrates this difference. 

Pre-multiplier 1832 multiplies the color channels with the opacity channel of the 
5 pixel it required. 

Interpolator 1832 interpolates between source pixels to find the true color of the 
pixel required. It gets two pixels from the source image memory at all times. Then it 
interpolates between those two pixels using the fraction part of the current x coordinate 
and puts the result in a register. After that, it obtains the two pixels on the next row from 
10 the source image memory. Then it interpolates between those two pixels using the same 
x fraction. After that, interpolator 1833 uses the fraction part of the current y coordinate 
to interpolate between this interpolated result and the last interpolated result. 

Accumulator 1834 does two things: 

1) it multiplies the matrix coefficients with the pixel, and 
15 2) it accumulates the product above until the whole matrix is traversed. Then 

it outputs a value to the next stage. 

Preferably the accumulator 1834 can be initialized with 0 or a special value on a 
channel-by-channel basis. 

Block 1835 rounds the output of accumulator 1834> then clamps any underflows 
20 or overflows to the maximum and minimum values if required, and finds the absolute 
value of the output if required The location of the binary point within the output of the 
accumulator is specified by the bp field in the kernel descriptor. The bp field 
indicates the number of leading bits in the accumulated result to discard. This is shown 
in the bottom diagram of Fig. 139. Note that the accumulated value is treated as a signed 
25 two's complement number. 

Yet another image processing operation that the main data path unit 242 can 
perform is matrix multiplication. Matrix Multiplication is used for color space 
conversion where an affine relationship exists between the two spaces. This is distinct 
from General Color Space Conversion (based on tri-linear interpolation). 
30 The result of Matrix Multiplication is defined by the following equation: 
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where r j is the result pixel and aj is the A operand pixel. Matrix must be 5 columns by 4 
rows. 

5 Fig. 140 illustrates a block diagram of the multiplier-adders that perform the 

matrix multiplication in the main data path unit 242. It includes multipliers to multiply 
the matrix coefficients with the pixel channels, adders to add the products together, and 
logic to clamp and find the absolute value of the output if required. 

The complete matrix multiplication takes 2 clock cycles to complete. At each 
10 cycle the multiplexers are configured differently to select the right data for the multipliers 
and adders. 

At cycle 0, the least significant 2 bytes of the pixel are selected by the 
multiplexers 1851, 1852. They then multiply the coefficients on the left 2 columns of 
the matrix, i.e. the matrix coefficients on line 0 in the cache. The results of the 
15 multiplication, and the constant term in the matrix, are then added together and stored. 

At cycle 1, the more significant 2 bytes of the pixel arc selected by the top 
multiplexers. They then multiply the coefficients on the right 2 columns of the matrix. 

The result of the multiplication is then added 1854 to the result of the last cycle. 
The sum of the adder is men rounded 1855 to 8 bits. 
20 The 'operand logic' 1856 rearranges the outputs of the multipliers to form four of 

the inputs of the adder 1854. It rearranges the outputs of the multipliers so that they can 
be added together to form the true product of the 24-bit coefficient and 8-bit pixel 
component 

The r AC (Absolute value-clamp/wrap) logic' 1855 firstly rounds off the bottom 
25 12 bits of the adder output. It then finds the absolute value of the rounded result if it is 
set to do so. After that it clamps or wraps the result according to how it is set up. If the 
'AC logic* is set to clamp, it forces all values less than 0 to 0 and all values more than 255 
to 255. If the 4 AC logic' is set to wrap, the lower 8 bits of the integer part is passed to 
the output. : : : 
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Apart from the image processing operations above, the main data path unit 242 
can be configured to perform other operations. 

The foregoing description provides a computer architecture that is capable of 
performing various image processing operations at high speed, while the cost is reduced 

5 by design reuse. The computer architect; re described is also highly flexible, allowing miy 
external programming agent with intimate knowledge of the architecture to configure it to 
perform image processing operations that were Dot initially expected. Also, as the core 
of the design mainly comprizes a number of those multifunction blocks, the design effort 
is reduced significantly. 

10 3*18*6 Data Cache Controller and Cache 

The data cache controller 240 maintains a four-kilobyte read data cache 230 within 
the coprocessor 224. The data cache 230 is arranged as a direct mapped RAM cache, 
where any one of a group of lines of the same length in external memory can be mapped 
directly to the same line of the same length in cache memory 230 (Kg 2). This line in 

15 cache memory is commonly referred to as a cache-line. The cache memory comprizes a 
multiple number of such cache-lines. 

The data cache controller 240 services data requests from die two operand 
organizers 247, 248. It first checks to see if the data is resident in cache 230. If not, 
data will be fetched from external memory. The data cache controller 240 has a 

zu programmable address generator, which enables the data cache controller 240 to operate 
in a number of different addressing modes. There are also special addressing modes 
where the address of the data requested is generated by the data cache controller 240. 
The modes can also involve supplying up to eight words (256 bits) of data to the operand 
organizers 247, 248 simultaneously. 

25 The cache RAM is organized as 8 separately addressable memory banks. This is 

needed for some of the special addressing modes where data from each bank (which is 
addressed by a different line address) is retrieved and packed into 256 bits. This 
arrangement also allows up to eight 32-bits requests to be serviced simultaneously if they 
come from different banks. 

30 The cache operates in the following modes, which will be discussed in more detail 

later. Preferably, it is possible to automatically fill the entire cache if this is desired. 

1. Normal Mode 

2. Single Output General Color Space Conversion Mode 
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3, Multiple Output General Color Space Conversion Mode 

4. JPEG Encoding Mode 

5, Slow JPEG Decoding Mode 

6. Matrix Multiplication Mode 
5 7. Disabled Mode 

8. Invalidate Mode 

Fig. 141 shows the address, data and control flow of the data cache controller 240 
and data cache 230 shown in figure 2. 

The data cache 230, consists of a direct mapped cache of the type previously 

10 discussed. The data cache controller 240, consists of a tag memory 1872 having a tag 
entry for each cache-line, which tag entry comprizes the most significant part of the 
external memory address that the cache-line is currently mapped to. There is also a line 
valid status memory 1873 to indicate whether the current cache-line is valid. All cache- 
lines are initially invalid. 

15 The data cache controller 240 can service data requests from operand organizer B 

247 (Fig 2) and operand organizer C 248 (Fig. 2) simultaneously via the operand bus 
interface 1875, In operation, one or both of the operand organizers 247 or 248 (Fig. 2), 
supplies an index 1874 and asserts a data request signal 1876. The address generator 
1881 generates one or more complete external addresses 1877 in response to the index 

20 1 874. A cache controller 1878 determines if the requested data is present in cache 230 
by checking the tag memory 1872 entries for the tag addresses of the generated addresses 
1877 and checking the line valid status memory 1873 for the validity of the relevant 
cache-line(s). If the requested data is present in cache memory 230, an acknowledgment 
signal 1879 is supplied to the relevant operand organizer 247 or 248 together with the 

25 requested data 1880. If the requested data is not present in the cache 230, the requested 
data 1870 is fetched from external memory, via an input bus interface 1871 and the input 
interface switch 252 (Fig. 2). Hie data 1870 is fetched by asserting a request signal 1882 
and supplying the generated address(es) 1877 of the requested data 1870. An 
acknowledgement signal 1883 and the requested data 1870 are then sent to the cache 

30 controller 1878 and the cache memory 230 respectively. The relevant cache-line(s) of 
the cache memory 230 are then updated with the new data 1870. The tag addresses of 
the hew cache-line(s) are also written into tag memory 1872, and the line valid status 
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1873 for the new cache-line(s) are asserted. An acknowledgment signal 1879 is then 
sent to the relevant operand organizer 247 or 248 (Fig 2) together with the data 1870. 

Turning now to Fig. 142, which shows the memory organization of the data cache 
230. The data cache 230 is arranged as a direct mapped cache with 128 cache-lines 
5 C0,...,C127 and a cache-line length of 32 bytes. The cache RAM consists of 8 
separately addressable memory banks B0»...,B7, each having 128 bank-lines of 32 bits, 
with each cache-line Ci consisting of the corresponding 8 bank-lines BQL,.,B7i of the 8 
memory banks BO, . . . B7. 

The composition of the generated complete external memory address is shown in 

10 Fig. 143. The generated address is a 32-bit word having a 20-bit tag address, a 7-bit line 
address, a 3-bit bank address and a 2-bit byte address. The 20-bit tag address is used for 
comparing the lag address with the tag stored in the tag memory 1872. The 7-bit line 
address is used for addressing the relevant cache-line in the cache memory 1870. The 3- 
bit bank address is used for addressing the relevant bank of the memory banks of the 

15 cache memory 1870. The 2-bit byte address is used for addressing the relevant byte in 
the 32-bit bank line. 

Turning now to Fig. 144, which shows a block diagram of the data cache controller 
240 and data cache 230 arrangement. In this arrangement, a 128 by 256 bit RAM makes 
up the cache memory 230, and as noted previously is organized as 8 separately 

20 addressable memory banks of 128 by 32 bits. This RAM has a common write enable 
port (write), a common write address port (writeoddr) and a common write data port 
(write_data). The RAM also has a read enable port (read), eight read address ports 
(readjaddr) and eight read data output ports (read_data). A write enable sigpal is 
generated by the cache controller block 1878 for supply to the common write enable port 

25 (write) for simultaneously enabling writing to all of the memory banks of the cache 
memory 230. When required, the data cache 230 is updated by one or more lines of data 
from external memory via the common write data port (write_data). A line of data is 
written utilizing die 8:1 multiplexer MUX supplying the line address to the write address 
port (write addr). The 8:1 multiplexer MUX selects the line address from the generated 

30 external addresses under the control of the data cache controller (addrjelect). A read 
enable signal is generated by the cache controller block 1878 for supply to the common 
read port (read) for simultaneously enabling reading of all the memory banks of cache 
memory 230. In this way, eight different bank-lines of data can be simultaneously read 
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from eight read data ports (read_data) in response to respective line addresses supplied on 
the eight read address ports (read_addr) of the memory banks of the cache memory 230. 

Each bank of the cache memory 230 has its own programmable address generator 
1881. This allows eight different locations to be simultaneously accessed from the 
5 respective eight banks of memory. Each address generator 1881 has a dec-mode input 
for setting the mode of operation of the address generator 1881, an index-packet input, a 
base-address input and an address output. The modes of operation of the programmable 
address generator 1881 include 

(a) Random access mode where a signal on the dec-mode input sets each address 
10 generator 1881 to the random access mode and complete external memory address(es) 

are supplied on the index-packet input(s) and outputted on the address output of one or 
more of the address generators 1881; and 

(b) JPEG encoding and decoding, color space conversion, and matrix 
multiplication modes, where a signal on the dec-mode input sets each address generator 

15 1881 to the appropriate mode. In these mudes. each address generator 1881 receives an 
index on the index-packet input and generates an index address. The index addresses are 
then added to a fixed base address supplied on the base-address input resulting in a 
complete external memory address which is then outputted on the address output 
Depending upon the mode of operation, the address generators are able to generate up to 

20 eight different complete external memory addresses. 

The eight address generators 1881 consist of eight different combinational logic 
circuits each having as their inputs; a base-address, a dec-mode and an index and each 
having a complete external memory address as an output. 

A base-address register 1885 stores the current base address that is combined with 

25 the index packet and a dee-mode register 1888 stores the current operational mode (dec- 
mode) of the data cache controller 240. 

The tag memory 1872 comprizes one block of 128 by 20 bit, mulli- port RAM. Tins 
RAM has one write port (update-line-addr), one write enable port (write), eight read ports 
(t^d0line-addr,....,read71ine-addi) and eight read output ports (tag0_data,.^tag7_data). 

30 This enables eight simultaneous lookups on the ports (read01me-addr,... l read71ine-addr) 
by the eight address generators 1881 to determine, for each line address of the one or 
more generated memory addresses, the tag addresses currently stored for those lines. 
The current tag addresses for those lines are outputted on the ports (tag0-data r ..tag7-data) 
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to the tag comparator 1886. When required, a tag write signal is generated by the cache 
controller block 1878 for supply to the write port (write) of the tag memory 1872 to 
enable writing to the tag memory 1872 on the port (update-line-addr). 

A 128-bit line valid memory 1873 contains the line valid status for each cache-line 

5 of the cache memory 230. This is 128 by 1 bit memory with one write pc. \ (u: ' -.ie-li/ e- 
addr), one write enable port (update), eight read ports (readOIine-addr,„.,read71ine-addr) 
and eight read output ports (LjnevalicO v .. f !ir:evalid7). In a similar manner to the lag 
memory, this allows eight simultaneous lookups on the ports (read01ine-addr,...,read71ine- 
addr) by the eight address generators 1881 to determine, for each line address of the one 

10 or more generated memory addresses, the line valid status bits currently stored for those 
lines. The current line valid bits for those lines arc outputted on the ports 
(Iinevalid0,.. n linevalid7) to the tag comparator 1886, When required, a write signal is 
generated by the cache controller block 1878 for supply to the. write port (update) of the 
line valid status memory 1873 to enable writing to the line valid status memory 1873 on 

15 the port (update-line-addr). 

The tag comparator block 1886 consists of eight identical tag comparators having; 
tag_data inputs for respectively receiving the tag addresses currently stored in tag memory 
1872 at those lines accessed by the line addresses of the currently generated complete 
external addresses, tag_addr inputs for respectively receiving the tag addresses of the 

20 currently generated complete external memory addresses, a dccjnput for receiving the 
current operational mode signal (decjnode) for setting the parts of the tag addresses to be 
compared, and a line jvalid input for receiving the line valid status bits currently stored in 
the line valid status memory 1873 at those lines accessed by the line addresses of the 
currently generated complete external memory addresses. The comparator block 1886 

25 has eight hit outputs for each of the eight address generators 1881. A hit signal is 
asserted when the tag address of the generated complete external memory address 
matches the contents of the tag memory 1872 at the location accessed by the line address 
of the generated complete external memory address, and the line valid status bit 1873 for 
that line is asserted. Li this particular embodiment, the data structures stored in external 



30 memory are small, and hence the most significant bits of the tag addresses are the same. 
Thus it is preferable to compare only those least significant bits of the tag addresses which 
may vary. This is achieved by the current operational mode signal (decjnode) setting 
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the tag comparator 1886 for comparing those least significant bits of the tag addresses 
which may vary. 

The cache controller 1878 accepts a request (procjeq) 1876 from the operand B 
247 or operand C 248 and acknowledges (proc_ack) 1879 this request if the data is 
available in cache memory 230. Depending on the mode of operation, up to eight 
differently addressed data items may be requested, one from each of the eight banks of 
cache memory 230. The requested data is available in cache memory 230 when the tag 
comparator 1886 asserts a hit for that line of memory. The cache controller 1878 in 
response to the asserted hit signal (hit0,...,hit7) generates a read enable signal on the port 
(cache_read) for enabling reading of those cache-lines for which the hit signal has been 
asserted. When a request (procjeq) 1876 is asserted, but not the hit signal (hit0,,.,»hit7), 
a generated request (cxt_req) 1890 is sent to the external memory together with the 
complete external memory address for that cache-line of data. This cache-line is written 
into the eight banks of cache memory 230 via the input (ext_data) when it is available 
from the external memory. When this happens, the tag information is also written into 
the tag memory 1886 at that line address, and the line status bit 1873 for that line asserted. 

Data from the eight banks of cache memory 230 is then outputted through a series 
of multiplexers in a data organizer 1892, so that data is positioned in a predetermined 
manner in an output data packet 1894. In one operational mode, the data organizer 1892 
is able to select and output eight 8-bit words from the respective eight 32-bit words 
outputted from the eight memory banks by utilising the current operational mode signal 
(dcc_mode) and the byte addresses (bytejaddr) of the current generated complete external 
memory addresses. In another operational mode, the data organizer 1892 directly 
outputs the eight 32-bit words outputted from the eight memory banks. As noted 
previously, the data organizer arranges this data in a predetermined manner for output. 

A request would comprize the following steps: 

1) The processing unit requests a packet of data by supplying an address to the 
processing unit interface of the cache controller 1878; 

2) Each of Ihe eight address generator units 1881 then generate a separate 
address for each block of cache memory depending on the mode of operation; 

3) The Tag portion of each of the generated addresses is then compared to the 
Tag address stored in the four blocks of triple-port Tag memory 1886 and addressed by 
each of the corresponding line part of the eight generated addresses; 
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4) If they match, and the line valid status 1873 for that line is also asserted, the 
data requested for that block of memory is deemed to be resident in the said cache 
memory 230; 

5) Data that is not resident is fetched via the external bus 1890 and all eight 
blocks of the cache memory 230 are updated with that line of data from external memory. 
The Tag address of the new data is then written to the Tag memory 1886 at the said line 
address, ?~A the line valid status 1373 for that line asserted; 

6) When all requested data items are resident in cache memory 230, it is 
presented to the processing unit in a predetermined packet format. 

As previously noted, all the modules (Fig 2) of the coproccessor 224 include a 
standard cBus interface 303 (Fig 20). For more details on the standard cBus interface 
registers for the data cache controller 240 and cache 230, reference is made to pages B42 
to B46 of Appendix B. The settings in these registers control the operation of the data 
controller 240. For the sake of simplicity only two of these registers are shown in Figure 
153, i.e. base_address and dcc_mode. 

Once the data cache controller 240 and data cache 230 are enabled, the data cache 
controller intially operates in the normal mode with all cache lines invalid. At the end of 
an instruction, the data cache controller 240 and cache 230 always reverts to the normal 
mode of operation. In all of the following modes except the "Invalidate" mode, there is 
an "Auto-fill and validate" option. By setting a bit in the dcc_cfg2 register, it is possible 
to fill the entire cache starting at the address stored in the base_address register. During 
this operation, the data requests from the operand organizers B and C 247,248 are locked 
out until the operation is complete. The cache is validated at the end of this operation, 
a. Normal Cache Mode 

In this mode, the two operand organizers supply the complete external memory 
addresses of the data requested The address generator 1881 outputs the complete 
external memory addresses which are then checked independently using the internal tag 
memory 1872 to see that if the data requested is resident in die memory cache 230. If 
both requested data items are not in cache 230, data will be requested from the input 
interface switch 252. Round Robin scheduling will be implemented to service persistent 
simultaneous requests. 



( 457 ) 



1 - 8 5 9 6 9 



- 164-23- 

For simultaneous requests, if one of the data items is resident in cache, it will be 
placed on the least significant 32 bits of each requestors data bus. The other data will be 
requested externally via the input interface switch, 

b. The Single Output General Color Space Conversion Mode 

5 In this mode, the request comes from operand organizer B in the form of a 12-bit 

byte address. The requested data items are 8-bit color output values as previously 
discussed with reference to Fig. 60. The 12-bit address is fed to the index_packet inputs 
of the address generators 1881 and the eight address generators 1881 generate eight 
different 32-bit complete external memory addresses of the format shown in figure 96. 

10 The bank, line and byte addresses of the generated complete addresses are determined in 
accordance with Table 12 and Fig. 61. The external memory address is interpreted as 
eight 9-bit line and byte addresses, which are used to address a byte from each of the eight 
banks of RAM. The cache is accessed to obtain the eight byte values from each bank 
which are returned to the operand organizers for subsequent interpolation by the main 

15 data path 242 in accordance with the principles previously discussed with reference to Fig. 
60. As the single output color value table is able to fit entirely within the cache memory 
230, it is preferable to load the entire single output color value table within the cache 
memory 230 prior to enabling the single color conversion mode. 

c. Multiple Output General Color Space Conversion Mode 

20 In this mode, a 12-bit word address is received from operand organizer B 247. 

The requested data items are 32-bit color output values as previously discussed with 
reference to Fig. 62, The 12-bit address is fed to the index_packet inputs of the address 
generators 1881 and the eight address generators 1881 generate eight different 32-bit 
complete external memory addresses of the format shown in figure 96. The line and tag 

25 addresses of the complete external memory addresses are determined in accordance with 
table 12 and Fig. 63. The completed external memory address is interpreted as eight 9- 
bit addresses with the 9-bit address being decomposed into a 7-bit line address and a 2-bit 
tag address as discussed previously with reference to Fig. 63. Upon the tag address not 
being found, the cache stalls while the appropriate data is loaded from the input interface 

30 switch 252 (Fig 2). Upon the data being available, the output data is returned to the 
operand organizers. 
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d. JPEG Encoding Mode 

In this mode, the necessary tables for JPEG encoding and other operational sub-seu 
are stored in each bank of cache RAM. The storage of tables being previously described 
in the previous discussion of the JPEG encoding mode (Tables 14 and 1 6). 
5 c Slow JPE Z Decoding Mode 

In this mode, the data is organized in accordance with Table 17. 
£ Matrix Multiplication Mode 

In this mode, the cache is utilized to access. 256 byte lines of data. 

g. Disabled Mode 

10 In this mode, all requests are passed through to the input interface switch 252. 

h. Invalidate Mode 

In this mode* the contents of the entire cache are invalidated by clearing all the line 
valid status bits, 

3.18.7 Input Interface Switch 
15 Returning again to Fig. 2, the input interface switch 252 performs the function of 

arbitrating data requests from the pixel organizer 246, the data cache controller 240 and 
the instruction controller 235. Further, the input interface switch 252 transmits 
addresses and data as required to the external interface controller 238 and local memory 
controller 236. 

20 The input interface switch 252 stores in one of its configuration register the base 

address or the memory object in the host memory map. This is a virtual address that 
must be aligned on a page boundary, hence 20 address bits are required. For each 
request made by the pixel organizer, data cache controller, instruction controller, the input 
interface switch 252 first subtracts the co-processor's base address bits from the most 

25 significant 6 bits of the start address of the data. If the result is negative, or the most 
significant 6 bits of the result are non-zero, this indicates that the desired destination is the 
PQ bus. 

If the most significant 6 bits of the result are zero, this indicates that the data 
maps to a co-processor's memory location. The input interface switch 252 then needs to 
30 check the next 3 bits to determine if the co-processor's location is legal or not 
The legal co-processor's locations that may act as a source of data are: - 
1) 16 Mbytes occupied by the Generic interface, beginning at an offeet of 
0x01000000 from the co-processor's base address. 
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2) 32 Mbytes occupied by the local memory controller (LMC), starting at an 
offset of 0x02000000 from the base address of the co-processor's memory object. 

Requests that map to an illegal co-processor's location are flagged as errors by 
the Input Interface Switch. 
5 The PCI bus is the source of data corresponding to any addresses that map 

outside of the range occupied by the co-processor's memory object. An i-source signal is 
used by the input interface switch to indicate to the EIC whether requested data is to 
originate from the PCI bus or the Generic interface. 

After the address decoding process, legal requests are routed to the appropriate 

10 IBus interface when the bus is free. The EIC or LMC is busy with a data transaction to 
the input interface switch when they have their i-ack signal asserted. However, the input 
interface switch does not keep a count for the number of incoming words, and so must 
monitor the i-oe signal, controlled by the pixel organizer, instruction controller or data 
cache controller, in order to determine when the current data transaction has completed. 

15 The input interface switch 252 must arbitrate between three modules: the pixel 

organizer, data cache controller and instruction controller. All of these modules are able 
to request data simultaneously, but not all requests can be instantly met since there are 
only two physical resources. The arbitration scheme used by the input interface switch is 
priority-based and programmable. Control bits within a configuration register of the 

20 input interface switch specify the relative priorities of the instruction controller, data 
cache controller and pixel organizer. A request from the module with the lower priority 
is granted when neither of the other two modules are requesting access to the same 
resource as it is. Assigning the same priority to at least two of the requesters results in 
the use of a round robin scheme to deduce the new winners. 

25 As immediate access to a resource may not be possible, the input interface switch 

needs to store the address, burst length and whether to prefetch data provided by each 
requester. For any given resource, the arbitration process only needs to determine a new 
winner when there is not an IBus transaction in progress. 

Turning to Fig. 145, there is illustrated the instruction interface switch 252 in 



30 more detail. The switch 252 includes the standard CBus interface and register file 860 in 
addition to two IBus transceivers 861 and 862 between an address decoder 863 and arbiter 
864. 
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The address decoder 863 performs address decoding operations for requests 
received from the pixel organizer, data cache controller and instruction controller. Th 
address decoder 863 checks the address is a legal one and performs any address re- 
mapping required. The arbiter 864 decides which request to pass from one IBus 

5 transceiver 661 to a second IBus transceiver 862. Preferra. !y, the priority systov ; s 
programmable. 

The IBus transceivers 861. 862 contain all the necessary mukirjlexine/ 
demultiplexing and tristate buffering to enable communication over the various interlaces 
to the input interface switch. 

10 318.8 Local Memory Controller 

Returning again to Fig. 2, the local memory controller 236 is responsible for all 
aspects of controlling the local memory and handling access requests between the local 
memory and modules within the co-processor. The local memory controller 236 
responds to write requests from the result organizer 249 and read requests from the input 

15 interface switch 252. Additionally, it also responds to both read and write requests from 
the peripheral interface controller 237 and the usual global CBus input. The local 
memory controller utilizes a programmable priority system and further utilises FIFO 
buffers to maximize throughput 

In the present invention, a multi-port burst dynamic memory controller is utilized 

20 in addition to using First-In-First-Out (FIFO) buffers to de-couple the ports from a 
memory array. 

Fig. 146 depicts a block diagram of a four-port burst dynamic memory controller 
according to a first embodiment of the present invention. The circuit includes two write 
ports (A 1944 and B 1946) and two read ports (C 1948 and D 19S0) that require access to 

25 a memory array 1910. The data paths from the two write ports pass through separate 
FIFOs 1920, 1922 and to the memory array 1910 yia a multiplexer 1912, while the data 
paths of the read ports 1948, 1950 pass from the memory array 1910 via separate FIFOs 
1936, 1938. A central controller 1932 coordinates all port accesses as well as driving all 
the control signals necessary to interface to the dynamic memory 1910. A refresh 

30 counter 1934 determines when dynamic memory refresh cycles for the memory array 
1910 are required and coordinates these with the controller 1932. 

Preferably, the data is read from and written to the memory array 1910 at twice 
the rate that data is transferred from the write ports 1944, 1946 to the FIFOs 1920, 1922 
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or from the FIFOs 1936, 193b* to the read ports 1948, 1950. This results in as little time 
as possible being taken up doing transfers to or From the memory array 1910 (which is the 
bottleneck of any memory system) relative to the time taken to transfer data through the 
write and read ports 1944, 1946, 1948, 1950. 

5 Data is written into the memory array 1910 via either one of the write ports 1944, 

1946. The circuits connected to the write ports 1944 3 1946 see only a FIFO 1920, 1922 
which are initially empty. Data transfers through the write ports 1944, 1946 proceed 
unimpeded until the FIFO 1920, 1922 is filled, or the burst is ended. When data is first 
written into the FIFO 1920, 1922, the controller 1932 arbitrates with the other ports for 

10 the DRAM access. When access is granted, data is read out of the FIFO 1920, 1922 at 
the higher rate and written into the memory array 1910. A burst write cycle to DRAM 
1910 is only initiated when a preset number of data words have been stored in the FIFO 
1920, 1922, or when the burst from the write port ends. In either case, the burst to 
DRAM 1910 proceeds when granted and continue until the FIFO 1920, 1922 is emptied, 

15 or there is a cycle request from a higher priority port. In either event, data continues to 
be written into the FIFO 1920, 1922 from the write port without hindrance, until the FIFO 
is filled, or until the burst ends and a new burse is started In the latter case, the new burst 
cannot proceed until the previous burst has been emptied from the FIFO 1920, 1922 and 
written to the DRAM 1910. In the former case, data transfers recommences as soon as 

2U the first word is read out of the FIFO 1920, 1922 and written to DRAM 1910. Due to 
the higher rate of data transfers out of the FIFO 1920, 1922, it is only possible for the 
write port 1944, 1940 to stall if the controller 1832 is interrupted with cycle requests from 
the other ports. Any interruption to the data transfers from the write ports 1944, 1946 to 
the FIFOs 1920, 1922 is preferably kept to a minimum. 

25 The read ports 1948, 1950 operate in a converse fashion. When a read port 1948, 

1950 initiates a read request, a DRAM cycle is immediately requested. When granted, 
the memory array 1910 is read and data is written into the corresponding FIFO 1 936, 1938. 
As soon as the first data word is written into the FIFO 1936, 1938, it is available for read- 
out by the read port 1948, 1950. Thus there is an initial delay in obtaining the first 

30 datum word but after that there is a high likelihood that there are no further delays in 
retrieving the successive data words. DRAM reads will be terminated when a higher 
priority DRAM request is received, or if the read FIFO 1936, 1938 becomes full, or when 
the read port 1948, 1950 requires no more data. Once the read has been terminated in 
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this way, it is not restarted until there is room in the FIFO 1936, 1938 foi a preset number 
of data words. Once the read port terminates the cycle, any data remaining in the FIFO 
1936, 1938 is discarded. 

In order to keep DRAM control overheads to a minimum, rearbitration for the 
5 DRAM access h restricted so that bursts cannot be interrupted until a preset number of 
data words have been transferred (or until the corresponding write FIFO 1920, 1922 is 
emptied, or read FIFO 1936, 1938 is filled). 

Each of the access ports 1944, 1946, 1948, 1950 has an associated burst start 
address which is latched in a counter 1942 at the start of the burst This counter holds the 

10 current address for transactions on that port so that, should the transfer be interrupted, it 
can be resumed at any time at the correct memory address. Only the address for the 
currently active DRAM cycle is selected by multiplexer 1940 and passed on to the row 
address counter 1916 and column address counter 1918. The low order N bits of address 
are inputted to the column counter 1918 while the higher order address bits arc inputted to 

15 the row counter 1916. Multiplexer 1914 outputs row addresses from the row counter 1916 
to the memory array 1910 during the row address time of the DRAM and passes column 
addresses from the column counter 1918 during column address time of the DRAM. The 
row address counter 1916 and the column address counter 1918 are loaded at the start of 
any burst to the memory array DRAM 1910. This is true both at the start of a port cycle 

20 and at the continuation of an interrupted burst. The column address counter 1918 is 
incremented after each transfer to memory has taken place while the row address counter 
1916 is incremented when the column address counter 1918 rolls over to a count of zero. 
When the latter happens, the burst must be terminated and restarted at the new row 
address. 

25 In the preferred embodiment it is assumed that memory array 1910 comprizes 4 x 

8 bit byte lines making up a 32 bits per word. Further there is associated with each write 
port 1944, 1946 a set of four byte write enable signals 1950, 1952 which individually 
allow data to be written to each 8-bil portion of each 32-bit data word in the memory array 
1910. Since it is possible to arbitrarily mask the writing of data to any byte within each 

30 word that is written to the memory array 1910, it is necessary to store the write enable 
information along with each data word in corresponding FIFOs 1926, 1928. These 
FIFOs 1926, 1928 are controlled by the same signals that control the write FIFOs 1920, 
1922 but are only 4 bits wide instead of the 32 bits required for the write data in FIFOs () 
\: ■ ■ ,■ - ; ;;• y$ ' ; 
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1920, 1922. In like fashion, multiplexer 1930 is controlled in the same manner as the 
multiplexer 1912. The selected byte write enables are inputted to the controller 1932 
which uses the information to selectively enable or disable writing to the addressed word 
in the memory array 1910 in synchronization with the write data being inputted to the 

5 memory array 1 910 by way of multiplexer 1912. 

The arrangement of Fig, 146 operates under the control of the controller 1932. 
Fig. 147 is a state machine diagram depicting the detail of operation of the controller 1932 
of Fig. 146. After power up and at the completion of reset the state machine is forced 
into state IDLE 100 in which all DRAM control signals are driven inactive (high) and 

10 multiplexer 1914 drives row addresses to the DRAM array 1910. When a refresh or 
cycle request is detected, the transition is made to state RASDEL1 1962. On the next 
clock edge the transition to state RASDEL2 1964 is made. On the next clock edge, if the 
cycle request and refresh have gone away, the state machine returns to state IDLE 1900, 
othervme, when the DRAM tRP (RAS precharge timing constraint) period has been 

15 satisfied, the transition to state RASON 1966 is made at which time the row address 
strobe signal, RAS, is asserted low. After tRCD (RAS to CAS delay timing constraint) 
has been satisfied, the transition to state COL 1968 is made, in which the multiplexer 
1914 is switched over to select column addresses for inputting to the DRAM array 1910. 
On the next clock edge the transition to state CASON 1970 is made and the DRAM 

20 column address strobe (CAS) signal is driven active low. Once the tCAS (CAS active 
timing constraint) has been satisfied, the transition to state CASOFF 1972 is made in 
which the DRAM column address strobe (CAS) is driven inactive high once again. At 
this point, if further data words are to be transferred and a higher priority cycle request or 
refresh is not pending or if it is too soon to rearbitrate anyway t and once the tCP (CAS 

25 precharge timing constraint) has been satisfied, the transition back to state CASON 1970 
will be made in which the DRAM column address strobe (CAS) is driven active low 
again. If no further data words are to be transferred, or if rearbitrating is taking place 
and a higher priority cycle request or refresh is pending, then the transition is made to 
state RASOFF 1974 instead, providing tRAS (RAS active liming constraint) and tCP 

30 (CAS precharge timing constraint) are both satisfied. In this state the DRAM row 
address strobe (RAS) signal is driven inactive high. On the next clock edge the state 
machine returns to state IDLE 1860 ready to start the next cycle. 
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When in state RASDEL2 1964 and a refresh request is detected, the transition 
will be made to state RCASON 1980 once tRP (RAS precharge timing constraint) has 
been satisfied. In this state DRAM column address strobe is driven active low to start a 
DRAM CAS before RAS refresh cycle. On the next clock edge the transition to state 
5 RRASON 1978 is made in which DRAM row address strobe (RAS) is driven active low. 
When tCAS (CAS active timing constraint) has been met, the transition to state 
POAPO^F 1976 will be made m which DRAM column address strobe fCAS^ w 
inactive high. Once tRAS (RAS active timing. constraint) has been met, the transition to 
state RASOFF 1974 is made in which DRAM row address strobe (RAS) is driven inactive 

1 0 high effectively ending the refresh cycle. The state machine then continues as above for 
a normal DRAWcycle, making the transition back to state IDLE 1960. 

The refresh counter 1934 of Fig. 146 is simply a counter that produces refresh 
request signals at a fixed rate of once per 15 microseconds, or other rate as determined by 
the particular DRAM manufacturer's requirements. When a refresh request is asserted, 

15 it remains asserted until acknowledged by the slate machine of Fig. 147* Hiis 
acknowledgement is made when the state machine enters state RCASON 1980 and 
remains asserted until the slate machine detects the refresh request has been de-asserted. 

In Fig. 148, there is set out in pseudo code form, the operation of the arbitrator 
1924 of Fig. 146. It illustrates the method of determining which of four cycle requesters 

20 is granted access to the memory array 1910, and also a mechanism for modifying the cycle 
requester priorities in order to maintain a fair access regime. Hie symbols used in this 
code are explained in Fig. 149. 

Each requester has 4 bits associated with it that represent that requester's priority. 
The two high order bits are preset to an overall priority by way of configuration values set 

25 in a general configuration register. The two low order bits of priority are held in a 2-bit 
counter that is updated by the arbitrator 24. When determining the victor in an 
arbitration, the arbitrator 1924 simply compares the 4-bit values of each of the requesters 
and grants access to the requestor with the highest value. When a requester is granted a 
cycle its low order 2-bit priority count value is cleared to zero, while all other requesters 

30 with identical high order 2-bit priority values and whose low order 2-bit priority is less 
than the victor's low order 2-bit priority have their low order 2-bit priority counts 
incremented by one. This has the effect of making a requester that has just been granted 
access to the memory array 1910 the lowest priority among requesters with the same 
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priority high order 2-bit value. The priority low order 2-bit value of other requesters 
with priority high order 2-bit value different to that of the winning requester are not 
affected. The high order two bits of priority determine the overall priority of a requester 
while the low order two bits instil a fair arbitration scheme among requesters with 
5 identical high order priority. This scheme allows a number of arbitration schemes to be 
implemented ranging from hard-wired fixed priority (high order two bits of each requester 
unique) through pari rotating and part hard-wired (some high order 2-bit priorities 
different to others, but not all) to strictly fair and rotating (all priority high order 2-bit 
fields the same). 

10 Fig. 149 depicts the structure of the priority bits associated with each requester 

and how the bits are utilized. It also defines the symbols used in Fig. 148. 

In the preferred embodiment, the various FIFOs 1920, 1922, 1938 and 1936 are 
32 bits wide and 32 words deep. This particular depth provides a good compromise 
between efficiency and circuit area consumed. However, the depth may be altered, with 

15 a corresponding change in performance, to suit the needs of any particular application. 

Also, the four port arrangement shown is merely a preferred embodiment. Even 
the provision of a single FIFO buffer between the memory array and either a read or write 
port will provide some benefits. However, the use of multiple read and write ports 
provides the greatest potential speed increase. 

20 3.18.9 Miscellaneous Module 

The miscellaneous module 239 provides clock generation and selection for the 
operation of the co-processor 224, reset synchronization, multiplexing of error and 
interrupt signals by routing of internal diagnostic signals to external pins as required, 
interfacing between the internal and external form of the CBus and multiplexing of 

25 internal and generic Bus signals onto a generic/external CBus output pins. Of course, 
the operation of the miscellaneous module 239 varies in accordance with clocking 
requirements and implementation details depending on the ASIC technology utilized. 
3.18*10 External Interface Controller 

The following described apsects of the invention relate to a method and an 

30 apparatus for providing virtual memory in a host computer system having a co-processor 
that shares the virtual memory. Hie embodiments of the invention seek to provide a co- 
processor able to operate in a virtual memory mode in conjunction with the host 
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[n particular, the co-processor is able to operate in a virtual memory mode of the 
host processor. The co-processor includes a virtuaJ-memory-tc-physical-memory 
mapping device that is able to interrogate the host processor's virtual memory tables, so as 
to map instruction addresses produced by the co-processor into corresponding physical 
5 addresses in the host processor's memory. Preferably, the virtual-mernory-to-physical- 
memory mapping device forms part of a computer graphics co-processor for the 
production of graphical images. The cc-processo r may include a large number of 
modules able to form various complex operations on images. The mapping device is 
responsible for the interaction between the co-processor and the host processor. 

10 The external interface controller (EIC) 238 provides the co-processors interface 

to the PCI Bus and to a generic Bus. It also provides memory management to translate 
between the co-processor's internal virtual address space and the host system physical 
address space. The external interface controller 238 acts as a master on the PCI Bus 
when reading the data from the host memory in response to a request from the input 

15 interface switch 252 and when writing data to host memory in response to a request from 
the result organizer 249. The PCI Bus access is implemented in accordance the well 
known standard with 'TCI Local Bus specification, draft 2.1", PCI special interest group, 
1994. 

The external interface controller 238 arbitrates between simultaneous requests 
20 for PCI transactions from the input interface switch 252 and the result organizer 249. 
The arbitration is preferably configurable. The types of requests received include 
transactions for reading less than one cache line of the host co-processor at a time, reading 
between one and two cache lines of the host and reading two or more cache lines of the 
host Unlimited length write transactions arc also implemented by the external interface 
25 controller 238. Farther, the external interface controller 23B optionally also performs 
prefetching of data. 

The construction of the external interface controller 238 includes a memory 
management unit which provides virtual to physical address mapping of host memory 
accesses for all of the co-processor's internal modules. This mapping is completely 
30 transparent to the module requesting the access. When the external interface controller 
238 receives a request for host memory access, it initiates a memory management unit 
operation to translate the requested address. Where the memory management unit is 
unable to translate the address, in some cases this results in one or more PCI Bus 
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transaction to complete the address translation. This means that the memory 
management unit itself can be another source of transaction requests on the PCI Bus. If 
a requested burst from the input interface switch 252 or results organizer 249 crosses the 
boundary of a virtual page, the external interface controller 238 automatically generates a 
5 memory management unit operation to correctly map all virtual addresses. 

The memory management unit (MMU) (915 of Fig. 150) is based around a 16 
entry translation look aside buffer (TLB). The TLB acts as a cache of virtual to physical 
address mappings. The following operations aie possible on the TLB: 

1) Compare: A virtual address is presented, and the TLB returns either the 
10 corresponding physical address, or a TLB miss signal (if no valid entry matches the 

address). 

2) Replace: A new virtual-to-physical mapping is written into the TLB, 
replacing an existing entry or an invalid entry. 

3) Invalidate: A virtual address is presented; if it matches a TLB entry, that 
15 entry is marked invalid. 

4) Invalidate All. All TLB entries arc marked invalid* 

5) Read: A TLB entry's virtual or physical address is read, based on a four bit 
address. Used for testing only. 

6) Write: A TLB entry's virtual and physical address is written, based on a 
20 four bit address. 

Entries within the TLB have the format shown in Fig. 151. Each valid entry 
consists of a 20-bit virtual address 670, a 20-bit physical address 671, and a flag which 
indicates whether the corresponding physical page is writable. The entries allow for 
page sizes as small as 4kB. A register in the MMU can be used to mask off up to 10 bits 
25 of the addresses used in the comparison. This allows the TLB to support pages up to 
4MB. As there is only one mask register, all TLB entries refer to pages of the same size. 

The TLB uses a "least-recently-uscd" (LRU) replacement algorithm. A new 
entry is written over the entry which has the longest elapsed time since it was last written 
or matched in a comparison operation. This applies only if there are no invalid entries; if 
30 these exist, they arc written to before any valid entries are overwritten* 

Fig. 152 shows the flow of a successful TLB compare operation. The incoming 
virtual address 880 is divided into 3 parts 881 - 883. The lower 12 bits 881 are always 
part of the offset inside a page and so are passed directly on to the corresponding physical 
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address bits 885. The next 10 bits 882 are either part of the offset, or part of the page 
number, depending on the page size, as set by the mask bits. A zero in the mask register 
887 indicates that the bit is part of the page offset, and should not be used for TLB 
comparisons. The 10 address bits are logically "ANDED" with the 10 mask bits to give 
th, lower 10 bits of (he virtual page number 839 f TLB lookups. The upper 10 bits 
883 of the virtual address are used directly as the upper 10 bits of the virtual page number 
889. 

The 20-bit virtual page number thus generated is dri\ en into the TLB. if it 
matches one of the entries, the TLB returns the corresponding physical page number 872, 
and the number of the matched location. The physical address 873 is generated from the 
physical page number using the mask register 887 again. The top 10 bits of physical 
page number 872 are used directly as Ihe top 10 bits of the physical address 873. TTie 
next 10 bits of physical address 872 arc chosen 87S from either the physical page number 
(if the corresponding mask bit is 1), or the virtual address (if the mask bit is 0). TTie 
lower 12 bits 885 of physical address come directly from the virtual address. 

Finally, following a match, the LRU buffer 876 is updated to reflect the use of 
the matched address. 

A TLB miss occurs when the input interface switch 252 or the results organizer 
249 requests an access to a virtual address which is not in the TLB 872. In this case, the 
MMU must fetch the required virtual-to-physical translation from the page table in host 
memory 203 and write it into the TLB before proceeding with the requested access. 

The page tabic is a hash table in the hosls main memory. Each page table entry 
consists of two 32-bit words, with the format shown in Fig* 153. The second word 
comprizes the upper 20 bits for the physical address and the lower 12 bits arc reserved. 
The upper 20 bits ef the corresponding virtual address are provided in the first word 
The lower 12 bits include a valid (V) bit and writable (W) or a "read-only" bit, with the 
remaining 10 bits being reserved. 

The page table entry contains essentially the same information as the TLB entry. 
Further flags in the page table are reserved. The page table itself may be, and typically is, 
distributed over multiple pages in main memory 203, which in general are contiguous in 
virtual space but not physical space. 

The MMU contains a set of 16 page table pointers, setup by software, each of 
which is a 20-bit pointer to a 4kB memory region containing part of the page table. This 
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means the co-processor 224 supports a page table 64kB in size, which holds 8k page 
mappings. For systems with a 4kB page size, this means a maximum of 32MB of 
mapped virtual address space. Preferably, the page table pointers always reference a 4kB 
memory region, regardless of the page size used in the TLB. 
5 The operation of the MMU following a TLB miss is shown 690 in Fig. 154, as 

follows: 

1. Execute the hash function 892 on the virtual page number 891 that 
missed in the TLB, to produce a 13-bit index into the page table. 

2. Use the top 4 bits 894 of the page table index 894, 896 to select a page 
10 table pointer 895. 

3. Generate the physical address 890 of the required page table entry, by 
concatenating the 20-bit page table pointer 895 with the lower 9 bits of the page table 
index 896, setting the bottom 3 bits to 000 (since page table entries occupy 8 bytes in host 
memory), 

15 4. Read 8 bytes from host memory, starting at the page table entry physical 

address 898. 

5. When the 8-byte page table entry 900 is returned over the PCI bus, the 
virtual page number is compared to the original virtual page number that caused the TLB 
miss, provided that the VALID bit is set to 1. If it does not match, the next page table 

20 entry is fetched (incrementing the physical address by 8 bytes) using the process described 
above. This continues until a page table entry with a matching virtual page number is 
found, or an invalid page table entry is found. If an invalid page table entry is found, a 
page fault error is signalled and processing stops* 

6. When a page table entry with a matching virtual page number is found, 
25 the complete entry is written into the TLB using the replace operation. The new entry is 

placed in the TLB location pointed to by the LRU buffer 876. 

The TLB compare operation is then retried, and will succeed, and the originally 
requested host memory access can proceed. The LRU buffer 876 is updated when the 
new entry is written into the TLB. 
30 The hash function 892 implemented in the EIC 238 uses the following equation 

on the 20 bits of virtual page number (vpn): 

index = ((vpn»Si) XOR (vpn»S2)XOR(vpn»S3)) & Oxlfff; 
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j where Sj, S2 and S3 are independently programmable shift amounts (positive or 

1 negative), each of which can take on four values. 

If the linear search through the page table crosses a 4kB boundary, the MMU 
automatically selects the next page table pointer to continue the search at the correct 
5 physical memory location. This heL;k--. wrapping around from the end of the page 
table to the start. The page table always contains at least one invalid (null) entry, so that 
these:..:b ;'-vays terrninpr.-s- 

Whenever the software replaces a page in host memory, it must add a page table 
entry for the new virtual page, and remove the entry corresponding to the page that has 
10 been replaced. It must also make sure that the old page table entry is not cached in the 
TLB on the co-processor 224. This is achieved by performing a TLB invalidation cycle 
in the MMU. 

I An invalidation cycle is performed via a register write to the MMU, specifying 

the virtual page number to be invalidated, along with a bit that causes the invalidation 

15 operation to be done. This register write may be performed directly by the software, or 
via an instruction interpreted by the Instruction Decoder. An invalidation operation is 
performed on the TLB for the supplied virtual page number. If it matches a TLB entry, 
that entry is marked invalid, and the LRU table updated so that the invalidated location is 
used for the next replace operation. 

20 A pending invalidate operation has priority over any pending TLB compares. 

When the invalidate operation has completed, the MMU clears the invalidate bit, to signal 
that it can process another invalidation. 

If the MMU fails to find a valid page table entry fbr a requested virtual address, 
this is termed a page fault. Hie MMU signals an error, and stores the virtual address that 

23 caused the fault hi a software accessible register. The MMU goes to an idle state and 
waits until this error is cleared. When the interrupt is cleared, the MMU resumes from 
the next requested transaction. 

A page fault is also signalled if a write operation is attempted to a page that is 
(not marked writable) marked read only. 

30 The external interface controller (E1C) 238 can service transaction requests from 

the input interface switch 252 and the result organizer 249 that are addressed to the 
Generic bus. Each of the requesting modules indicates whether the current request is for 
the Generic Bus or the PCI bus. Apart from using common buses to communicate with 
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the input interface switch 252 and the results organizer 249, the EICs operation for 
Generic bus requests is entirely separate from its operation for PQ requests. The HC 
238 can also service CBus transaction types that address the Generic bus space directly. 

Fig. 150 shows the structure of the external interface controller 238. The IBus 
5 requests pass through a multiplexer 910, which directs the requests to the appropriate 
internal module, based on the destination of the request (PCI or Generic Bus). Requests 
to the Generic bus pass on to the generic bus controller 911, which also has RBus and 
CBus interfaces. Generic bus and PCI bus requests on the RBus use different control 
signals, so no multiplexer is required on this bus. 

10 EBus requests directed to the PCI bus are handled by an IBus Driver (IBD) 912. 

Similarly, an RBus Receiver (RBR) 914 handles the RBus requests to PCI. Each of the 
IBD 912 and RBR 914 drive virtual addresses to the memory management unit (MMU) 
915, which provides physical addresses in return. The IBD, RBR and MMU can each 
request PCI transactions, which are generated and controlled by the PQ master mode 

15 controller (PMC) 917. Hie IBD and the MMU request only PCI read transactions, while 
the RBR requests only PCI write transactions. 

A separate PCI Target Mode Controller (PTC) 918 handles all PCI transactions 
addressed to the co-processor as a target. This drives CBus master mode signals to the 
instruction controller, allowing it to access all other modules. The PTC passes returned 

20 CBus data to be driven to the PQ bus via the PMC, so that control of the PCI data bus 
pins comes from a single source. 

CBus transactions addressed to EIC registers and module memory are dealt with 
by a standard CBus interface 7. All submodulcs receive some bits from control registers, 
and return some bits to status registers, which are located inside the standard CBus 

25 interface. 

Parity generation and checking for PCI bus transactions is handled by the parity 
generate and check (PGC) module 921, which operates under the control of the PMC and 
PTC. Generated parity is driven onto the PQ bus, as are parity error signals. The 
results of parity checking are also sent to the configuration registers section of the PTC 
30 for error reporting. 

Fig. 155 illustrates the structure of the IBus driver 912 of Fig, 150. Incoming 
IBus address and control signals are latched 930 at the start of a cycle. An or-gate 931 
detects the start of the cycle and generates a start signal to control logic 932. The top 
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address bits of the latch 930, which form the virtual page number, are loaded into a 
counter 935. The virtual page number is passed to the MMU 915 (Fig. 150) which 
returns a physical page number which is latched 936, 

The physical page number and the lower virtual address bits are recornbined 
5 according to the mask 937 and form the :vid:cs.<: 938 for PCI requ . is to the PMC IV" 
(Fig.102). The burst count for the cycle is also loaded into a counter 939. Prefetch 
operations use another counter 941 and an address latch and compare circin* 94-3. 

Data returned from the PMC is loaded into a FIFO 944, along with a marker 
which indicates whether the data is part of a prefetch. As data becomes available at the 
10 front of the FIFO 944, it is clocked out by the read logic via synchronization latches 
945,946. The read logic 946 also generates the EBus acknowledge signal. 

A central control block 932, including state machines, controls the sequencing of 
all of the address and data elements, and the interface to the PMC. 

The virtual page number counter 935 is loaded at the start of an IBus transaction 
15 with the page number bits from the IBus address. The top 10 bit of this 20-bit counter 
always come from the incoming address. For the lower 10 bits, each bit is loaded from 
the incoming address if the corresponding mask bit 937 is set to 1; otherwize, the counter 
bit is set to 1. The 20-bit value is forwarded to the MMU interface. 

In normal operation the virtual page number is not used after the initial address 
20 translation. However, if the IBD detects that the burst has crossed a page boundary, the 
virtual page counter is incremented, and another translation is performed. Since the low 
order bits that are not part of the virtual page number are set to 1 when the counter is 
loaded, a simple increment on the entire 20-bit value always causes the actual page 
number field to increment Hie mask bits 937 are used again after an increment to set up 
25 the counter for any subsequent increments. 

The physical address is latched 936 whenever the MMU returns a valid physical 
page number after translation. The mask bits are used to correctly combine the returned 
physical page number with the original virtual address bits. 

The physical address counter 938 is loaded from the physical address latch 936. 
30 It is incremented each time a word is returned from the PMC. The count is monitored as 
it increments, to determine whether the transaction is about to cross a page boundary. 
The mask bits are used to determine which bits of the counter should be used for the 
comparison. When the counter detects that there are two or less words remaining in the 
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page, it signals the control logic 932, which the terminates the current PCI request after 
two more data transfers, and requests a new address translation if required. The counter 
is reloaded after the new address translation, and PCI requests resumed. 

The burst counter 939 is a 6-bit down counter which is loaded with the TBus 
5 burst value at the beginning of a transaction. It is decremented every time a word is 
returned from the PMC. When the counter value is two or less, it signals to the control 
logic 932, which can then terminate the PCI transaction correctly with two more data 
transfers (unless prefetching is enabled). 

The prefetch address register 943 is loaded with the physical address of the first 

10 word of any prefetch. When the subsequent IBus transaction starts, and the prefetch 
counter indicates that at least one word was successfully prefetched, the first physical 
address of the transaction is compared to the value in the prefetch address latch. If it 
matched, the prefetch data is used to satisfy the IBus transaction, and any PCI transaction 
requests start at the address after the last prefetched word. 

15 The prefetch counter 941 is a four bit counteT which is incremented whenever a 

word is returned by the PMC during a prefetch operation, up to a maximum count equal to 
the depth of the input FIFO. When the subsequent IBus transaction matches the prefetch 
address, the prefetch count is added to the address counter, and subtracted from the burst 
counter, so that PCI requests can start at the required location. Alternatively, if the IBus 

20 transaction only requires some of the prefetched data, the requested burst length is 
subtracted from the prefetch count, and added to the latched prefetch address, and the 
remaining prefetch data is retained to satisfy further requests. 

The Data FIFO 944 is a 8 word by 33 bit asynchronous fall through FIFO, Data 
from the PMC is written into the FIFO, along with a bit indicating whether the data is part 

25 of a prefetch. Data from the front of the FIFO is read out and driven onto the IBus as 
soon as it becomes available. The logic that generates the data read signals operates 
synchronously to elk, and generates the IBus acknowledge output If the transaction is to 
be satisfied using prefetched data, signals from the control logic tell the read logic how 
many words of prefetched data should be read out of the FIFO. 

30 Fig. 156 illustrates the structure of the RBus Receiver 914 of Fig. 150. Control 

is split between two state machines 950, 951. The Write state machine 951 controls the 
interface to the RBus. The input address 752 is latched at the start of an RBus burst. 
Each data word of the burst is written in a FIFO 754, along with its byte enables. If the 
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FIFO 954 become full r-ready is deasserted by the write logic 951 to prevent the results 
organiser from attempting to write any more words. 

The write logic 951 notifies the main slate machine 950 of the start of an RBus 
burst via a resynchronized start signal to prevent the results organizer from trying to write 
5 any more woiti>,. The to r * address bits, which form the virtual page number, are loaded 
into a counter 957. The virtual page number is passed to the MMU, which returns a 
physical page number 958. The prysicn* page number and the lower bits of the virtual 
address are recombined according to the mask, and loaded into a counter 960, to provide 
the address for PCI requests to the PMC Data and byte enables for each word of the 

10 PCI request are clocked out of the FIFO 954 by the main control logic 950, which also 
handles ail PMCM interface control signals. Hie main state machine indicates that it is 
active via a busy signal, which is resynchronized and returned to the write state machine. 

The write state .machine 951 detects the end of an RBus burst using r-final. It 
stops loading data into the FIFO 954, and signals the main state machine that the RBus 

15 burst has finished. The main state machine continues the PCI requests until the Data 
FIFO has been emptied. It then deasserts busy, allowing the write state machine to start 
the next RBus burst 

Returning to Fig. 150, the memory management unit 915 is responsible for 
translating virtual page numbers into physical page numbers for the IBus driver (IBD) 912 

20 and the RBus receiver (IBR) 914. Turning to Fig. 157, there is illustrated the memory 
management unit in further detail. A 16 entry translation lookaside buffer (TLB) 970 
takes its inputs from, and drives its outputs to, the TLB address logic 971. The TLB 
control logic 972, which contains a state machine, receives a request, buffered in the TLB 
address logic, from the RBR or IBD. It selects the source of the inputs, and selects the 

25 operation to be performed by the TLB. Valid TLB operations are compare, invalidate, 
invalidate all, write and read. Sources of TLB input addresses are the IBD and RBR 
interfaces (for compare operations), the page table entry buffer 974 (for TLB miss 
services) or registers within the TLB address logic. The TLB returns the status of each 
operation to the TLB control logic. Physical page numbers from successful compare 

30 operations are driven back to the IBD and RBR. The TLB maintains a record of its least 
recently used (LRU) location, which is available to the TLB address logic for use as a 
location for write operations. 
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When a compare operations fails, the TLB control logic 972 signals the page 
table access control logic 976 to start a PCI request. The page table address generator 
977 generates the PCI address based on the virtual page number, using its internal page 
table pointer registers. Data returned from the PCI request is latched in the page table 
5 entry buffer 974. When a page table entry that matches the required virtual address is 
found, the physical page number is driven to the TLB address logic 977 and the page table 
access control logic 976 signals that the page table access is complete. The TLB control 
logic 972 then writes the new entry into the TLB, and retries the compare operation. 

Register signals to and from the SCI are resyndironized 980 in both directions. 
10 The signals go to and from all other submodules. A module memory interface 981 
decodes access from the Standard CBus Interface to the TLB and page table pointer 
memory elements. TLB access are read only, and use the TLB control logic to obtain the 
data. The page tabic pointers are read/write, and are accessed directly by the module 
memory interlace* These paths also contain synchronization circuits. 
15 3.18.11 Peripheral Interface Controller 

Turning now to Fig. 158, there is illustrated one form of peripheral interface 
controller (PIC) 237 of Fig. 2 in more detail. The PIC 237 works in one of a number of 
modes to transfer data to or from an external peripheral device. The basic modes are: 

1) Video output mode. In this mode, data is transferred to a peripheral under 
20 the control of an external video clock and clock/data enables. The PIC 237 drives output 

clock and clock enable signs with the required timing with respect to the output data. 

2) Yideo input mode. In this mode, data is transferred from a peripheral under 
the control of an external video dock and data enable. 

3) Centronics mode. This mode transfers data to and from the peripheral 
25 according to the standard protocol defined in IEEE 1284 standard. 

The PIC 237 decouples the protocol of the external interface from the internal 
data sources ur destination in accordance with requirements. Internal data sources write 
data into a single stream of output data, which is then transferred to the external 
peripheral according to die selected mode. Similarly, all data from an external 
30 peripheral is written into a single input data stream, which is available to satisfy a 
requested transaction to either of the possible internal data destinations. 



There are three possible sources of output data: the LMC 236 (which uses the 
ABus), the RO 249 (which uses the RBus), and the global CBus. The PIC 237 responds 
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to transactions from these data sources one at a time - a complete transaction is completed 
from one source before another source is considered. In general, only one source of data 
should be active at any time. If more than one source is active, they are served with the 
following priority - CBus, then ABus, then RBus. 

usual, the modi, is c '.crates under the control of the standard CBm interf- ~z 
990 which includes the PIC's internal registers. 

Further, a OB us data interface 992 is provider! for accessing an 3 controlling 
peripheral devices via the co-processor 224. An ABus interface 991 is also provided for 
handling memory interactions with the local memory controller. Both the ABus 
interface 991 and CBus data interface 992 in addition to the result orginizer 249 send data 
to an output data path 993 which includes a byte - wide FIFO. Access to the output data 
path is controlled by an arbiter which keeps track of which source has priority or 
ownership of the output stream. The output data path in turn interfaces with a video 
output controller 994 and Centronics control 997 depending on which of these is enabled. 
Each of the modules 994, 997 reads one byte at a time from the output data path's internal 
FIFO. The Centronics controller 997 implements the Centronics data interfacing standard 
for controlling peripheral devices. The video output controller includes logic to control 
output pads according to the desired video output protocols. Similarly, a video input 
controller 998 includes logic to control any implemented video input standard. The 
video input controller 998 outputs to an input data path unit 999 which again comprizes a 
byte wide input FIFO with data being written into the FIFO asynchronously, one byte at a 
time, by either the video input controller 998 or Centronics controller 997. 

A data timer 996 contains various counters utilized to monitor the current state of 
FIFO's within output data paths 993 and input data path 999. 

It can be seen from the foregoing that the co-processor can be utilized to execute 
dual streams of instructions for the creation of multiple images or multiple portions of a 
single image simultaneously. Hence, a primary instruction stream can be utilized to 
derive an output image for a current page while a secondary instruction stream can be 
utilized, during those times when the primary instruction stream is idle, to begin the 
rendering of a subsequent page. Hence, in a standard mode of operation, the image for a 
current page is rendered and then compressed utilising the JFEG coder 241. When it is 
required to print out the image, the co-processor 241 decompresses the JPEG encoded 
image, again utilising the JFEG coder 241. During those idle times when no further 
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portions of the JPEG decoded image are required by an output device, instructions can be 
carried out for the compositing of a subsequent page or band. This process generally 
accelerates the rate at which images are produced due to the overlap operating of the co- 
processor. In particular, the co-processor 224 can be utilized to substantial benefit in the 

5 speeding up of image processing operations for printing out by a printer attached to the 
co-processor such that rendering speeds will be substantially increased. 

It will be evident from the foregoing that discussion of the preferred embodiment 
refers to only one form of implementation of the invention and modifications, obvious to 
those skilled in the art, can be made thereto without departing from the scope of the 

10 invention. 
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Appendix A: 

The Coprocessor Microprogramming 

This section details the actions performed internally in ihc coprocessor each lime a new 
instruction is executed. AH self -configuration earned out by the coprocessor during instruc- 
ion execution is performed via internal register reaJi and wriies, consequently the coproc- 
essor can be fully nracmprognimmed via the External Cbos Interface or by the host via tie 
PCI bus Interface. Note thai microprogramming via the host interlace will typically be very 
difficult due to the problems of synchronising the nasi. 

: n tills L.v.i » :-us.*.Tv±d that the rcz:.-iT is ; V* cop; cesser's: 

1. execution model, 

2. instruction set and its coding, 

3. register set, 

4. and interna] structure,. 



A.1 General Notes 
A.t .1 General CoprocessorSetup Principles 

For all instructions other than Control Instructions and Local DMA Instructions, the data 
flow tluough the coprocessor is fundamentally under the control of the Pixel Organiser. The 
Pixel Organiser is responsible for fetching the primary input stream of data, for counting this 
flow of data and for determining when the final piece of data has been fetched. Generally 
speaking, the other modules within the coprocessor simply respond to data sent to them. 

A.1. 2 Module Configuration Order 

Not all modules arc set up for every instruction Some modules are never configured in the 
course of instruction decoding. Tic order thai modules are configured Is always (be same 

and is PCX DCC. OOB, OOC, MDP, JC. RO, PTC 

A.1 .3 Setting miscellaneous registers 

If an instruction is encoded to include the setting of a register value, then that register is set 
during microprogrammi ng in sequence defined by the following rules: 

1. If the register to be set is in a module thai would otherwise not have any registers set* 
then that register is set prior to any other register seal ngs 

2. If the register to be set is in a module that will have other registers sct» then the extra reg- 
ister is set after other registers tut immediately prior to that module's _cfg register. 

A. 1.4 Inconsistent Instruction Operand Coding 

Many instructions have implied data types for operands and results, with other data types 
producing meaningless output. For each of the operand*, the coprocessor determines the 
intended format using the following procedure: 

I. if Ok internal format of the operand is specified to be one of pixels, packed bytes or 
unpacked bytes, then the corresponding Operand Organiser will be set tip to reflect this. 
The Data Cache Controller will not be configured and will thus continue to operate in 
^normal** mode. 

Z if die internal format of the operand is specified to be "other then the coprocessor will 
derive the format of the operand from the instruction. For Operand B and Operand C, this 
is straight forward For Operand A there is no naturally implied "other" format, and the 
coprocessor's behaviour is undefined. The cont spondi ng Operand Organiser will be left 
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in bypass mode, and the Data Cache Controller will be set up to manage the operand data 
in the derived format. 

Microproaiaimnjns is reasonably orthogonal splii between the various modules. 

5 Pseudocode syntax 

■ The sequence of operations is given by the numbers in the left margin. 

* Register names are in helvetica bold. 

* Register fields are denoted regis ter.iield 

* l ( D = Instruction word and Data word respectively of instruction currently ban • decoded 

* A, B and C = A Operand word, B Operand word and C Operand word o/instn *-'Jon cur- 
rently being decoded 

- A_descriptor, B_descriptor and (^descriptor = A descriptor. B descriptor and C 
descriptor of Data Wuid of instruction currently being decoded 

* R = Result word of instniction currently bei ng decoded 

* "X : = X concatenated with Y 

* "@)T s the coprocessor register number X 

* "cbus ( X)" - do CBus operation X 

- "*cbus ( x j " « data received frorn CBus read operation X 

* ***3T = virtual memory locarion X 

* = unknown value, yet to be determined 

* "set" refers to the retting of the Data Maripulauon Registers. 



Compositing Operators 

Notes 

1 . Major opcode - OxC and OxJD 

2. Opacity is considered to be the byte of highest address U. is the most significant byte. 

3. Either the accumulator or the operand my be pre-mulriplied 

4. Result may be un-pre-multiplied 

5. Instruction length is defined in terms of the number of input pixels 

Pixel Organ her: 
if LR-0 then 

pojen <- oxoooo rUength 
endif 

if A_descriptor.S=0 then 

po_d mr set 
endif 

po_said <- R 

po_c(g.inode <- sequential C et going 

po_cfg.dst <- mdp 

DCC in *norroal32* mode <de£ault) 



Operand organiser 3: 
if BjttescriptorJf != other then 
oobjen <- pojen 
if B_desoriptor.6 - 0 then 

oob_dtnr <- set 
endif 

oob_said <- A 

if B_d©ceriptor.what - tile 

oobjllle <- 6 
endiC 



not blend 



special for compositing 
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9- oob_cfg.operate <■ operate 

endif 

Operand Organiser C 

There is no implied other data :ype for this 

bo used. to specify any biL offset in a bit map_ attenuation 
if C_descriptor.if ! = other zfcer. 
"0. coc_lu:r <- po_Jen 

- - set up the ooc^dmr 

if C_descrttor_S ~ 0 then 
11 - oocjamr <- set -- 'ncludinc bo 

en3il' 

12. ooc_sald <- c 

13. ooc_ctg.operate <:- operate 
endif 

Main Data Path: 
if BjdescrlptorJr - other then 

14. mdpjen <- pojen 

15. mdp_bm <- B 

16. mdp_bl <- A 
end if 

17. mdpjcfgJnstructlon <- I. opcode 
mdp_cf gjjlendgen < - < Bjdoscr IplorJf 
mdp_efg.cfocode <- 1 
mdp_ofg. operate <- 1 

Result Organiser: 
if R_descritpor.S = 0 then 

18. rojdrnr <- set 
endif 

19. ro_sa <- R 

20- ro__cfgjnode <- sequential 



blend 



= other) 



A.3 Colour Space Conversion 

Notes 

K Input space is always three dimensions. By default this is the three least significant chan- 
nels of the pixel - the opacity is the one thrown away. 

2. The colour table format is one of two types: containing either one output channel or four 
output channels. 

Pixel Organiser: 

operand A only makes sense as source pixels, anything else 
— is probably- wrong 
if I.R - 0 then 

1. po_Un <- OxOOOOtUen 
endif 

if A_descriptor.S = o then 

2. po_dmr <- set 
endif 

3. po said <- A 

4. po_muv <- C I * P tables 

5. po_cfg.modc <- esc 
pq_cfg.dst <- mdp 
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' - DCC setup : 

- should be or her 
if B_descriptor.if = other then 
6- dcc_addr <- B 

7. dcc_cig2.cache_mi33_inst B_descrjp1or.C 

if J.M = 0 then single output channel 

dec cfg2.mode <- single channel CSC 

else 

dec„ctg2.mode <- multi channel CSC 
endif 
endif 

Operand Organiser B; 

Operand B should be other, anyrhing else in probably 

wrong bat do it anyway 
if B_descriptor.if •= other then 
6. oobjen <- pojen 

if B_descn*ptOr.S 0 then 
9- oob_dmr <- set 

endif 

10. oob^sald <- B 

11. oob__cfg.opcrate <- operate 
endif 

Operand Organiser Cr 

Operand C should be other, anything else is probably 
wrong but do it anyway 
if C_descriptor.if !- other then 

12. ooc_!en <- pojen 

if Cjdescrlptor.S <= 0 then 

13. ooc_dmr <- set 
endif 

14. ooc.ssld <- c 

15. ooo_cfg.operate <- operate 
endif 

Main Data Path: 
1$. mdpjcfgJnstructfon <- I. opcode 
mdp_cfgjd©o&de <- 1 
mdp_cfgjOperale i 

Result organiser: 
if Rjdescrlptor.S « 0 then 
17- rojdror <- set 
endif 

18. rojaa <- R 

19. ro_cfg.mode <- sequential 



A.4 JPEG Instructions 



Notes 

!. Opcode - 0x2 

2. Operand C may te a register to set 

3. Oct ions Are manv: 
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in 

- subsamp!iii£ or not 

* CiMcrin^ ornoi 

* 1 , 3. or 4 scan 

niese inimicuoiis rely on several registers lo be set up prior to ilie irsiruction bcin° 
cuicd. 



P '■compress; 



Notes 

l. 7heioL;owi;; 0 re^erj. . jJd b& set prior t o ibis i..:-.,i/r.uo.': b:..^ seemed: 
- rojdr; otsput image dimensions register 

• ro_cuL output cut register 

• rojml: output limit register 



--if there is a register set targeted at anything other than 

the OC t po, RO or DCC 
if (D.n - z) and < C_descr I ptor.top nibble != 0x5,0x6,0x9 or OxB) then 

§C_descrip1or <- C 
endif 



Pixel Organiser: 

Operand A is the source compressed data, and should be a 
--. byte stream 
if LR = O then 

pojen <- 0x0000: Lien 
endif 

if A_descr1ptor_S = o then 

po_d<nx <- set 
endif 

po^said <- A 

--if there is a register set targeted at PO 
if (O.R «= 1) and { O_descriptor.topnibbte = 0x6) then 

eC_descrfctor <- C 
endif 

po_cfg.mode <- sequential 
po_cfg.dst <- jc 



DCC setup: 
if B_de*criptorJf « other then 
decjaddr <- B 

if there is a register set targeted, at DCC, 
it (M - iy *d<? (<^ie»crtptorio^Rtt^ - 0x5) then 

4C__de9Criptor <- G 
endix 

dce_clgZC <- B u descriptor.C 
dcc_clg2jnode o JPEG decode 
end if 



— operand Organiser fl: 

-- Operand B should be other, 

wrong, but do it anyway 
if 8_descriptor.if |- other then 

oobjen <- pojen 

if 6_descrlptor.S « 0 then 
oob_dmr <- set 

endif 



anything else is probably 
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11. 

12. 



ooU._said <- B 
oob_cfg.operate < - operate 
endif 



Operand Organiser C: 
-- Operand C may be a register tc sot. if it isn't it should 

be other, anything clso is prcb=.bly wrong, but do it anywa 
if <R.D = 0) and ( C_descriptoMf !- c:her) then 

13. ooc_len <- po_Jen 

if C descriplor.S = 0 then 

14. ooc_dmr <- set 
endit 

15. ooc_said <- C 
ooc_cfgjoperale < - operate 

endif 



- - JC setup: 

if there is a register set targeted at JC 
if (D.R - 1) and ( C_dcscriptor.topnibble « 0x9) then 

16. @C_descrlptor <- C 
endif 

17. jc_cfgJnst ruction <- I. opcode 
jejcfg.decode <-. 1 
je.cfgjoperote <- 1 

Result organiser: 
if R^descriptor.S = O then 

18. ro jdmr <- set 
endif 

if there is a register write targeted at RO 
10. ro_sa <- Fl 

if (D.R - 1) and <C_descrlptor.topnabb!o = OxB> then 

20. eC_descrlptor <- C 
endif 

21. ro^cfgjTjodo <- jpeg 
ro_clg.chan <- I.M:L4 
ro_cfg.upwimple <- I.S 
ro_cfgxu1 <- LC 
rojclgJImh <- IT 



A. 4.2 Compression 

Notes 

1 . Hie following registers should tc set prior to this instruction being executed: 

• pojdr: output image dlmenst ons register 

• Jp_rml: restart marker interval 

• ro_cut: output cut register 

" ro_tmt; output limit register 



if there is a reqister set targsted at anything other than 
the JC, PO, ro or Dec 
if (D.R = 1) and (C_descrIptoMopnlbb!e <= 0x5.0x6,0x9 or 0xB| then 
0. ec_descriptar <- C 

endif 

Pixel Organiser: i . 
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Operand A is the source cojr.presseci data, <tnd should be a 

— - byte st rcatr. 
if LR - 0 then 

1- pojen <- OxOOO0:l.len 

endif 

if A_descriptor.S = 0 then 

2. o_dmr <- set 
e:-i „iLf 

3. po_said <- A 

--if there is a register set targeted at PO 
1' 1 D,R 1) and ( C_dcoorjptor.topnibbi-? - 0x6"! then 

4. ^0_<Jescriplor <- C 
endif 

5. po_cfg.mode <- jpeg 
po_cfg.dst <- jc 

po_cfg.SS <- I.S subsampling 
po_cfgxhan <- LM:L4 
po__cfg.F <- IF 

— DCC setup: 

if B_descrlptor.lf = other then 

6. dcc_addr <- B 

— if there is a register set targeted at DCC 

if (D.R = 1) and (C_desoriptor.topn(bble « 0>c5) then 

7. £C_descriptor <- c 
endif 

8. dccjcfg2^ache„rnJssJnst <- B_descriptor.C 
dcc_cfg2.rnode <- jpeg encode 

end if 



Operand Organiser B; 
-- Operand B should be other, anything else is probably 

wrong, but do it anyway 
if B.rfescriptorjf != other then 

9. oobjen <- pojen 

if B_descriptoi\S - 0 then 

10. oobjdmr <- set 
endir 

11. oobjsaid <- B 

12. oab_cfgjoperate <- operate 
endif 



Operand Organiser ,C; , 
-~ operand c may be a register to set. it \!t isn't Lt should 
— be other, anything else is probably wrong, but do it 

- - Anyway - 

if {RJD - 0) and ( CjdeacrfptotJt I- other) then 

13. joogjten <- po_len 

if C_de*cr1ptor.S * 0 then 

14. ooc_dmr <- set 
endif 

15. ooc said <- C 

16. ooc_cfg.operate <- operate 
endif 



JPEG Coder: 

-- if there is a register set targeted at JC 
if {D.R « 1) and ( C_de3crlptor.topnlbble - 0x9 } then 
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17. <?C_descriptor < 
cr.dif 

18. Jc_ctg. Instruction < 
jc_cfg,d&uude <~ 1 
jc_cfg.operato <- 1 

Result creamier . 
if R_descriptor.S = 0 then 

19. ro_drnr <- set 
endif 

20. ro_sa <- R 

--if there Is a register write targeted at KO 
if (D.R - 1) and CC.descriptor.top nibble = OxB) then 

21. eC_descriptor <- C 
endif 

22. ro cfg.mode <- sequential 
ro_cfgxut <- I.C 
ro_cfgjlmlt <- IT 

A. 5 Data Coding 

Notes 

1 . All data coding operations, wbetter compressing or decompressing are dealt with in the 
same way. Setting up for these operations looks very similar to that for JPEG. 

2. Possible encoding operations are: 

• Huffman encode 

* predictive encode 

3. Possible decoding operations ax: 

- fast huffman decode 

♦ slow hufrman decode 

* packtritt decoding (version A) 

• packbits decoding (version B) 

* predictive decode 

.4. Operand C may be a register to set 

2, The following itgisieis may be required to be set prior lo this mstniction being executed: 

- ro_cut output cut register 

• ro jmt output limit regi iter 

--if there is a register act targeted at anything other than 

the JC, ro, HO ot DCC 
if (DJ* - 1) and (C.descriptor.topnlbble l-» 0x5,0x6,0x9 or OxB) then 

0. ec_<fescrlptor <- C 
endif 

-- Pixel Organ iser: 

--'operand A is the source data 

it in « 0 then 

1. pojan <- OxOOOO:l.lon 
endif 

if A^deacriptor.S - 0 then 

2. pojdmr <- set 
endif 

3. po_sald <- A 

-- if there is a register set targeted at PO 
if (O.R - i) and { 0_descriptorJopnJbble - 0x6) then 

4. eC_descr1ptor <- C # 

■ -i ! " ■ *v 'v v 



- c 

- I.cpcodc 
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end L t: 

5. po_cfg.mode <- sequential 
po_cfg.dst <- jc 

- - DCC setup ; 

if 8_de3criptor.ff - other then 

6. dcc_j3d. <- 0 

-- if Chere is a register set t^rgeied at DCC 
if <D.R - 1) and (C_dcscripior.t op nibble = OxS) then 

7. £C_d«scriptor <- C 
end i f 

8. dcc_ctg2.c3che_miss_lnst <- B_descrlptor.C 
dcc_ctg2jnode <- en/de coding 

else 

if there is a register set targeted at DCC 
if <D.R - 1) and (C_descrip to r.top nibble = 0x5) then 

eQjdescripior <- C 
end if 

dcc_cfg2.eache_miss_inst <- B_deserlptor.C 
endif 



- Operand Organiser B: 

Operand B should be other, anything else is probably 
vrong, but do it anyway 
if B_descriptor,if ! = other then 

9. oobjen <- pojen 

if B_descriptor.S - 0 then 

10. oob_dmr <- set 
endif 

1 1 . oob _said < - B 

12. oobcfg.oporat© <- operate 
endif 

Operand Organiser C: 
-- Operand C may t»e a register to set. If it isn't it sftould 
--be other* anything else is probably wrong, but do it 

- - anyway 

if (R.D -* 0) and f C_de$crip!orJf I- other) then 

13. oocjen <- pa J en 

if O^descrfptor.S - 0 then 
"14. ooc dmr <— »ei 

endif 

15, OOC_G3ld <- C 

16, ooo_ofg.op»r*ts <- tope rata 
endif 

JPEG Coder: 

- - • if there is a register set targeted at JC 
if ° 1) and { C_descriptor.tcpnIbbU * 0x9) then 

17, GC_d@seriptor <- C 
endif 

18, Jc_cfgJnstnjctk)n <- I. opcode 
Jc_cfa.decode <- l 

jc_cfg. ope rate <- 1 

Result Organiser: 
if R_descriptor.S » 0 then 

19, ro_dmr <- set 
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endif 

20. ra_sa <- R 

if there is a register write targeted at RO 
it ( D.H - 1) and (C_de3Cripto Mop nibble = 0x6) then 

21. *C_descriptor <- C 
endif 

22. ro^cfg.ntado <- sequential 
ro_cfg.cut < ■ I.C 

ro_cfg. limit <- I.T 



A.6 Transformations and Convolutions 

Notes 

1 . Opcode = 0x4 (convolutions) and 0x5 (transformations) 

2. the coprocessor performs an operation which is a superset of what is requi red for each of 
Image Transformations and Convolutions. As far as the coprocessor is concerned the 
only difference between a transformation and a convolution is tiiat for a transformation, 
the kernel step size (horizontally and vertically) is the Size of ihe kernel, whereas for a 
convolution, the step size is one source pixel 

3. Options are: 

• i nterpolating or snapping-to-closest pixel s 

• pixel accumulation (i-e. a kernel) or not 

• prc-mukiplicaiion or not of the source piad s 

• clamp, wrap or absolute value for decerrolrtng final result 

4. Note that transformations and convolutions don'l work in place, i.e. if the source and 
destination pointers are the same it will break. 

Pixel Organiser: 
-- operand A is the kernel descriptor and the PO delivers 
-- Kernel co-erf icients to the 1QP. This is coded as 
*- "other" 

- - L *= 1 long format 

0 short format 
if LR « 0 then 

1. pojen <- 0x0000: Lien 
endir 

if A_descriptorS = 0 then 

2. pojdrnr < - set 
endif 

3. pojsald <- A 

4. pojcfg.mode <- con voLut ion/transformation 
pOL.cf9.dst <- mdp 

— DCC setup; 

the implied data type here is the source image 
if-B_descrtptor.il « other then 

5. dcc_cfg2.cache_rrissjnst <- B_d*3criptor.C 
if LS » 0 then 

dec_oig2jnode <- 6 4 bit mode 

else 

dcc_cfg2jnode <• random mode 
endif 
else 

dcc_cfg2.eache_mlss_lnsl <- D descriplor.C 
endif 



k 
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Operand organiser 3: 

-- Operand D is a pointer to source image (other), ^nythin 
olse is probably wrong, but dc it anyway 
if B_descr1ptor.lf !- other then 

6. oobjen <- pojen 

if Q_dftscriptor.S = 0 then 

7. oob_dmr <- s^L 
endif 

8. oob_sqid <- B 

9. oob_cfg.operate < - operate 



-- Operand Organiser C: 

-- operand C descriptor is borrowed by the r.c field 
so OOC is not set up 

Main Data Path: 
-- Kernel descriptor is delivered to MDP via PO 

10. mdpjen <- pojen 

11. mdp_bm_rows <- Cdescriptor.r 
mdp brn.cols <- Cjte script or. c 

12; mdp_cfgJnstruction <- I. opcode 

mdp_cfg.long_kml <- A_descriptor.L 
mdp_cfg. decode <- 1 
i»dp_cfg.opwste <- I 

Result Organiser: 

result will be pixels or part thereof 
if n_descriptor.S - 0 then 

13. it>_dmr c- set 
endif 

14. »o_sa <- R 

15. ro_cfg.mode <- sequential 



A.7 Matrix Multiplication 

Notes 

1. Opcode c 0*3 

2. Options arc: 

* pre^ multiplication or not of the source pixels 

* clamp, wrap or absolute value for determining final result 

* Operand C may be a register write 

•-It there id a register set targeted at anything other than 

the MDP, PO, RO or DCC 
if *(D.R - X) and ( C_descrlptor.topnlbMe i» 0x5,0x6, OxA or OxB) then 

0. £ (^descriptor <- C 
endiC 

- - Pixel Organiser? 
- Operand A is the source pixelo . only makers sense to have 
whole pixels, anything else is probably wrong 
if 1.R = 0 then 

1. pojen <- 0x000O:IJen 
endif 

if Ajdescriptor.S - 0 then 
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2- podrnr <- set 

endif 

3. po_sald <- A 

it there is a register set Lar^Led aL PO 
if (D.R - 1) ana { C_descriptor.lopnibbl& = 0x6) then 

4. @C descriptor <- C 
endif 

5. po cfg.modB <- sequential 
p0_nfp.dst <- map 

-- DCC setup: 

the implied other dat-.A type is a fuatrix of coefficients 
if Bjrtescriplor.if = other then 

6. dec addr <- B 
endif 

if there is a register write targeted at DCC 
if (tXfl - 1) and (Cjdescrfptor.topnlbble = 0x5) then 

7. €C_descrlptor <- C 
endif 

if B_de$crJptorJf « other 

8. dcc_cfg2.cache_irrfss_tn&1 <- B_desorIptor.C 
dcc_cfg2.mode <- matrix multiply 

else 

dco_cf g 2.cach e_mi cs J n si <- B^descriptor.C 
endif 

Operand Organiser B : 
-- Operand B is the matrix co-efficients (other), 
-- anything else is probably wrong, but do it anyway 
it B_descr1ptorJt •= other then 

9. oobjen <- pojen 

if B_descriptor.S = 0 then 

10. oob__dmr <- set 
endif 

11. oob_said <- B 

12. oob_cfg.opcrate <- operate 
endif 

Operand Organiser C: 

Operand c may be a register to set. If it isn't it should 
--be other * anything else is probabLy wrong but do it anyway 
if (R.D « 0) and (C_desci1ptor.il != other) then 

13. oocjen <- pojen 

if C_descr[ptor.S » 0 then 

14. ooojdmr <- oet 
endif 

15. ooc_sald <- C 

16. ooc_cfg.operate <- operate 
endif 

-- MDP setup: 

if there is a register set targeted at MDP 
if (D.R " 1) and (C_descriptor.topnlbbJe = OxA) then 

17. $C_descriptor <- C 
endif 

18. mdp_dg.inst ruction < - (.opcode 
mdp_cfg.detode <- 1 
mdp^cfg.op^ratB • < - l 
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Result Oit^n iser : 
if R_descriptor.S = 0 then 

19. ro_dmr <- set 
end if 

20. tojsa <- R . 

-* iX there is a register vrite targeted at RO 
if (D.R = a j and {C_doscriptof/,opnibbIe = 0x0) then 

21. SC_descriptor <- C 
endif 

22. rocfg.mode sequen*:.ial 



A.8 Halftoning 

Notes 

1- Opcode » 0x7 

2. Only option is the number of levels of halftoning 

3. can be done on pixels or bytes as long as the halftone screen is appropriately unraeshed 
or meshed. 

-- Pixel Organiser; 

"A operand is the source pixels or bytes 

-- PO operates in conventional sequential mode 

if I.R « 0 then 

1. pojen 0x0000: Men 
endif 

if A_descrlptor.S = 0 then 

2. po_dmr <- set 
endif 

3. po said <- A 

4. po_efg.mode <- sequential 
po cfg.dst <- mdp 

DCC setup: 
--no implied other data type 

--if there is a register vrite targeted at DCC 
if <D.R « l) and (C„descrlptor.topnlDWe = 0x5) then 

5. ^descriptor <- c 
endif 

if B_descriptor.C » O then 

6. dc<3_cf g2_cache ml** JnsX <- B descriptor.C 
endif 

-- Operand Organiser 3: 

-- Operand a ic the halftone screen, and should be a tile 
--'anything else is probably wrong. There is no sensible 
-* implied "other- data type for halftoning, 
if B_descriptor.lf other then 

7. oobjen <- pojen 

i f B dfttcrlptcr.S = 0 then 
6. oobjdmr <- set 

endif 

9. oob_sa?d <- B 

if B_de$crfptor.what = tile 

10. oobjile <- C 

endif ( 
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11. oob_cfg,operate <- operate 
end if 

Operand Organiser C: 
-- Operand C vord is borrcwei by -ne tiled Operand £3. 
-- The C_descriptcr is completely ignored 
-- OOC is in random .-node (default) and wont be touched 

Main Data Path: 

12. mdp_brn .level <- Kiev els 

13. mdp_cfg .instruction <z- I . opcode 
mdp_cfg^Jecode <- 1 
tndp_cfg .operate <- 1 

Result Organiser: 
if R_dBScriptor.S - 0 tr.en 

14. ro_dmr <- set 
end if 

15. ro_sa <- R 

16. ro cfg.mode <- sequential 



Memory Copy 

Notes 

1. Opcode « 0x9 

2. Tliis insuuai<n uses two quite distinct mechanisms to achieve a memory copy operation: 

♦ A General Data Transfer utilises the normal dau flow path Uuuugh the coprocessor and 
so can utilise the various functions associated with the data manipulation units in the PO 
and RO. 

• A Peripheral DMA utilises a direct connection between the PIC and rhe J .MC This means 

that no data manipulation can be performed, and thai this operation may be oveilappsd 
with subsequent instructions. 

9.1 General Data Transfer 

-- Pixel Organiser; 
-- A operand is the souxce d=ta 
— PO operates in conventional sequential mode 
if I.R - 0 then 
1- pojen <- 0x0000: Men 

end if 

if A^desoriptor.S ~ 0 then 

2. po_dmr <- B 
end if 

3. poisaW <- A 

4. po_cfg.made <- sequential 
po_efg.dst <- jc 

Operand organiser B: 

operand D word represents a value to put in po_dmr 
-- Operand Organiser C: 

~- Operand C word represents the value to put in the ro_dmr 
JPEG Coder: t 
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5. jc_cfg.lnslruct ; jn <- 1. opcode 
jc__cfg.lbo <- O.bo 
jc_cfg.obo <- l.obn 
jc_cfg. decode <~ 1 
jc cfg.cperate <- l 



ResuLr Organ is*-:: r : 
Lf R_de scrip; jr.S = J than 

6. ro_dmr <- C 
end 

7. ro_sg < n 

S. ro_r.fg.mode < ssqu^ ^iaj 



A.9.2 Peripheral DMA 

Motes 

1. May be overlapped or nOL This is dealt with by the 1C 

2. Operand C may be a register to set 

3. This instruction is different from other Activity" instructions in that the PIC is the mod- 
ule thai drives the data. 

only setup required is for the ?IC: 
if 1.S = l then -- £ata in from pic 

1. pic__abus_addr <- R 
else 

pic _abus_addr <- A 
endif 

-- if there is a register write targeted anywhere 
if D.R « l then 

2. ec_descrlptor <- C 
endif 

if 1.R = 0 then 
3- plc_abus_oig.ab_coun1 <- OxOO:Llength 

endif 

plc_abus_cfg.ab_byt&_en <- Kbyte 
plc_abus__cfg.ab_type <- iJR 
pic_abus_cfg.start <- 1 
Pixel Organiser; 

Operand A is the source address given to the PIC 
" - The PO is not setup 



" Operand organiser B: 
~- ODD is not setup 

■- Operand Organiser C: 
ooc is not setup 

Result Organiser: 

The Result word represents the target address 
- - RO is not set up 



( 493 ) 1 1 - 



A. 10 PhotoCD Decompression 

This family of instructions consists of three different operations: liori7.GntaI interpolation, 
vertical interpolation ainJ residual merging. As it aappens, the setup for Vertical interpola- 
tion and the setup for residua] merging arc identical. 

Opcode, for all rhese instructions is 0x9 

A.10.1 Horizontal Interpolation 



Noles 

1 . can operate on pixel s or by tes 

.2. this is a one operand instrucii on so Operand C may be a register to set 

--if there is a register set targeted at anything other than 
-- the MDP, PO or RO 

if (0.R - 1) and (C_descriptor.1opnIt)ble 1= 0X5, OXA or OxB) then 

0. @C_descrtplor <- C 
endif 

— - Pixel Organiser : * 
if I.R « 0 then 

1. pojen <- 0x0000 :Uen 
endif 

if A^descriptor.S = fl then 

2. po_dmr <- set 
endi f 

3. po said <- A 

--if there is a register set targeted at. po 
if (O.R - 1) and ( C_descriptor.topnibble = 0x6) then 

4. €C_descriplor <- C 
endi f 

5. pojcfg.mode <- sequential 
po_cfg,dst <- indp 

— operand Organiser B : 

-- Operand B word is completely ignored 

— The B_descxi?tor is completely ignored 

Operand organiser C: 

Operand C may be a register to set up 

- Main Data Path: 
--if there is a register Get targeted at MDP 
if (0-R - 1) and < C_de3criptorJopnIbbte - OxA) then 

6. eC_descriptor <- C 
endif 

7. mdpjcfgJnstniction <- i . opcode 

if A descriptor.*? packed bytes then-- tell ndp about size 

mdp_cfg.wordJnput <- l 
else 

mdp_cfg.wordJnput <- 0 
endif 

mdp_cfg.decode <- 1 
mdp_cfg.operate <- 1 



- Result Organiser: 
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if Rdescrlptcr.S = 0 then 
8. ro_drnr <- set 

endif 
9- ro_sa <• R 
10. ro__cfg.m»ds <- sequential 

.10.2 Vertical Interpolation and Re" : :'uai f rgfng 

Notes 

1 . The sci up f o: VcoJ ca) I n icrpol a Li or) a nd Res id'.: if Merg i n a is i : J - :w i ca ! . 

2. Can operate on either bytes or pixels 

3. This is a two operand instruction so Operand C may be a register set 

— if there is a register set targeted at anything other than 
the MDP, PO, OOB or RO 

if (D.R = 1) and (C_deseriptor.tDpnibbl» 1 = 0x6,0x7, 0:*A or 0>cB) th«n 

0. ecdescrlptor <- C 
endif 

Pixel organiser: 
if LR = 0 then 

1. pojen <- 0x0000: Men 
endif 

if A_de»criptor.S - 0 then 

2. podmr <- set 
endif 

3. po_said <- A 

if there is a register set targ-red at po ' 
if (D.R - 1) and (C_descriptor.topnibble = 0x6) then 

4. C^descrlptor <- C 
endif 

5. po_cfg.mode <- sequential 
po. cfg.dsl <- rndp 

- - DCC setup; 

--no implied other data type 

--if there is a register write targeted at Dec 
if ( D.R = X\ and (Cjdeecriptor.iopnlbble - 0x5) then 

6. @C_d«scriptor <- C 
endif 

if B_descriptoT.C - 0 then 

dcc_c1g2.cache_mlss_lnst <- B descriptors 
endif 

Operand Organiser B: 
-- operand B is the residuals or one operand tor 

interpolation, anything else is probably wrong, but 

- - do it anyvoy 

- There i© no sensible implied * other" data type for 

- this. 

if B_descrlptor.lt i= other then 

7. oobjen <- po Jen 

i f B_descriptor.S - D then 

8. oob_dmr < - set 
endif 

9. oob_said <- B 
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\n 

oncif 

*- if there is <l register sec tarc-^ed i±t OOB 
if (O.R = 1) and (C__descriptor.topnibble = 0x6) then 

10- 0C_descriptDr <- C 
endif 

11- oob_c(g.operate <- operate 
endi £' 

~- Operand Organiser C: 
-- may be a register to set 

-- Main Data Path: 

--if there is a register set targeted at HDP 
if <D.R = 1) and <C_descriptor.topnibble *= OxA) then 

dC_descripior <- C 
endif 

mdpjcfgJnstructlon <- L opcode 

if A_descrfptorJf /= packed_bytes then - tell mdp about size 
mdp_cfg.word_Jnput <- 1 

else 

mdp_cfg.word_input <- 0 
endif 

mdp_clg.de code <- 1 
mdp_cfg.operate <- l 

-- Result organiser: 
if R_descrfptor.S = 0 then 

14. ro_dmr <- set 
endif 

15. ro_sa <- R 

16. ro_cfg.mode <- 0 -- sequential 

A. 1 1 Control Instructions 

Notes 

1. Control Instructions consist of two classes of operations: Flow Control Instructions ana 
Internal Access Instructions. 

A.11.1 Flow Control 



12. 
13. 



Notes 

1. Opcode <=0xB 

2. Flow Control Instruction* currently consist of lumps of various kinds and Waits of vari- 
ous kinds. 

3. No explicit setup is done In the coprocessor, and this instruction is not an "activity" 
instruction, ie no the coprocessor submodules other than the instruction controller actu- 
ally do anything. 

4. Operand C may oca register to set 

only thing to do is; 
if D.R = l then 
0. ec_descriptor <- C 

endif 

Pixel Organiser: 4 
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"-no setup associated wiU\ Operand A 

Operand Organiser B : 
-- no setup associated with Operand B 

Operand. ciyaniser C: 
*- no setup associated vi;:.!> Ope ran C 

-- Result Organiser: 

--no setup associated vitn the Result 

A.1 1 .2 internal Access: Read 



Notes 

1. Opcode = 0*A 

2. A "read" is a transfer of data our of the coprocessor 

3. THe only module in che coprocessor that actually dees anything for this is the RO 
-- Pixel Organiser: 

- - no setup associated vith Operand A 

Operand Organiser Br 

- - no setup associated with Operand R 

Operand Organiser C: 
-- no setup associated with Opkrand c 

-- Result Organiser t 

1. ro_subst <- Cbus(A) 

2. ro_sa <*. B 

3. ro_ctg.mode <- Cbus_data 



A.1 1.3 Interna! Access: Write 



Notes 

1. Opcode = OxA 

2. A "write*' is a transfer of data i nto the coprocessor. 

3. This instruction is not an -activity" instruction as no modules other than the 1C actually 
do anything. 

1. Cbus<A) <- B 

-- Pixel Organiser: 

--no setup associated with Operand A 

operand organiser R : 

ho setup associated vith Operand B 

■ - Operand Organiser C: 

no setup associated vith Operand C 



Result Organiser: 

no setup associated vith the Result 
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12 Reserved Instructions 

Notes 

1 . Opcodes 0x0. OxE and OxF are "j^secvecT. 

2. A "reserved" instruction will generate a (maskable) error. 

3. These "reserved" opcodes arc iraended to be used for other instructions that mav be 
added in taier revisions of the coprocessor. 
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Appendx B: Registers 



1 .1 Registers and Tables 

This Section describes the coprocessor's registers. Tr-se registers can be modified in one / 
three ways: 

1. Specific the coprocessor instructions exist for reading and writing registers. By usin^ 
these instructions these registers can be read or *riritn usin» initiator PCIbus cvcles, or 
by transactions on the Generic Inien^. or to/fran . -y memory ife may be ■S.r^i^i 
to the Local Memory Interface, 

2. Many registers change value as a side effect of instruction execution. The primary mech- 
anism by which the coprocessor conjures itself for instruaioii execution is by setting 
various registers to reflect the currently executing instruction. At the end of the instruc- 
tion, the registers reflect the stale of the coprocessor at thai time. 

Most typical operations can he completely specified and set up by the one instruction- In 
some eases, it Is necessary to set some registers immediately prior to executing the instruc- 
tion. 



Semantics of "reserved" register bits 

Any registers or pans thereof Utat are "reserved'* have the following semantics: 

• a write io a reserved locaiion will succeed and the data will be discarded 

• a read from a reserved location will succeed and will result in undefined value 
All unspecified registers and register fields are reserved. 

1-1.1 ; Register Taxonomy 

The registers in the coprocessor are claisi fled into types based on their behavior as described 
in this section. In these descriptions: 

- externally - external to the module, U. via the Ctes interface. I.e. via target mode PCI, by 
the instruction controller or via the external Cbus interface. Note that the registers can't 
be set via bit-set (Type Q mode from the PCIbus. 

• internally = internally to the module 



Status Register 

A status register is read-only externally and rcanVwriic internally. 
Contlgl Register 

A Conflgl register Is mart' write externally and read only internally. 

A Configl register does not support Type C Cbus operations (ic does not support oil set 
mode) and is used for registers that hold byte (or bigger) type configuration information 
such as address values. 



Canfig2 Register 

A Conftgl register is also read/write externally and read only internally. 

A Config2 register supports Type C Cbus operations fie supports bit set mode) and is used 
for registers that hold configuration Information thai may need to be set on a bit by bit basis. 



ConlroU Register 

A Controll register is readable/writable both internally and externally. 
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A Control! register does wot suppon Type C Cbus operations (ic docs not support bit set 
mode) and is used for registers that hold byte (or bisecr) type control information such as 
address values. 

C-ntro!2 Register 

A CorcrOl2 register is readable/writable both internally and externally. 

A Control2 register supports Type C Cbus operations (ie supports bit set mode) and Ls used 
for registers thai hold control information that may needio be set on a bit by bit ba si s. 

Interrupt Register 

The bits within an Interrupt register are sellable 10 I internally and resectable to 0 externally 
by writing a 1 to that bit. 

Each of the module interrupt/error registers are of this type. Each module's intemipt/error 
register consul su of three fields: 

[7:0] represents any error conditions generated by the module 
[23:83 represent any exception conditions generated by the module 
[31:24} represents any interrupt conditions generated by the module. 

1.1.2 Register Map 

Table 1,1 shows the coprocessor's registers. The "number* in Table 1.1 is the number of the 
register rather than a byte address. 



Table 1.1 the coprocessor registers 





lilyllll 




External Interface Controller Registers page 26 


0x00 


eic_cfg 


Config2 


Configuration 




0x01 


eic_stat 


Status 


Status 




0x02 


eic_err_int 


Interrupt 


Error and Interrupt Status 




0x03 


eicjenrjnt_en 


Config2 


Error and Interrupt Enable 




0x04 


eic^test 


Config2 


Test modes 




0x05 


eie_gen_pob 


Coniig2 


Generic bus programmable output bits. 




0x06 


dc_high_addr 


Configl* 


Dual address cycle offset 




0x07 










0x08 


cic_wilb_v 


Control 2 


Virtual address and operation bits for TLB 
InvaJi date/Write 




0x09 


eic_wtlb_p 


Config2 
• 


Physical address and control bits for TLB 
Write 




OxOA 


eic_mmu_v 


Status 


Most recent MMTJ virtual address translated* 
and current LRU location. 




OxOB 


eic_mmu_p 


Status 


Most recent page table physical address 
fetched by MMU. 




OxOC 


eicjp_addr 


Status 


Physical address for most recent IB us access 
to the PCI Bus. 




OxOD 


eic_rp_addr 


Status 


Physical address for most recent RBus access 
to the PCI Bus. 
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1.1 the coprocessor registers 













OxOE 


eic„ig_addr 


Status 


Address for must recent I Bus access to the 
Generic Bus. 




OxOF 


eic_rg_daia 


Status 


Address for most uceni RBus access to the 
Generic Bus. 






1-ocal Memory Controller Registers 


page 32 


Ox 10 


lwii_cfg 


Comrol2 


General con figuration register 




Ox 1 1 


IroLsts 


Status 


Genera) status register 




0x12 


lmi_err_im 


Interrupt 


Error and interrupt status register 




0x13 


Jnrijerrjncen 


Control2 


Error and interrupt enable register 




0x14 


Imijlcfg 


ControI2 


PRAM configuration register 




0x15 


lrni_mode 


Control2 


SDRAM mode register 




0x16 










0x17 










0x18 










0x19 










Oxl A 










Oxl B 










Oxl C 










OxlD 










Ox IE 










OxlF 










Peripheral Interface Controller Registers 


pa^e 36 


0x20 


pic_cfg 


Config2 


Configuration 




0x21 


pic_stat 


Status 


Status 




0x22 


plc^err^ini 


Interrupt 


Interrupt/Error Status 




0x23 


pic_crrjnueti 


Config2 


Iotemipi/Erfor Enable 




0x24 


pic_pbws w cfg 


Comrol2 


Configuration and control for ABus 




0x25 


pic_abus_addr 


Cnnfigl 


the copiocessort address for ABus transfer 




0X26 


picjoert^cfg 


Control 2 


Configuration and control for Centronics 
mode. 




0x27 


plq.cert^dir 


Conftg2 


Centronics pin direct control register 




0x28 


picLjevctseLcTe 


Control 2 


Qanfiguration and control for reverse (Input) 
data transfer* 




0x29 










0x2A 


pic„timciO 


Configl 


Initial data timer value. 




0x2B 


pic_iimcr| 


Conftgl 


Subsequent data timer value 




0X2C 










0x2D 










Ox2E 










0x2F 










Miscellaneous Module Registers 




0x30 1 


tnm _cfg 


Config2 


Configuration Register | 
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Table 1. \ the coprocessor registers 







■x-Xi.v..ik}t2. : £\-> % 






0x31 


mm stat 


5 tafus 






Dk32 


nun e rr j nt 


11 HU lUyi 


Error and Interrupt Rctistei 





0x33 


mm r t r' int n 
1I **»*„CIl__UH en 




Error and Interrupt Ma>!cs 


— . 


0x34 


< ■! ■ tJ V 


COtuigz 


Global Configuration Register 




0x35 




Uoruig 


Diagnostic Configuration Register 




0x36 


mni_grst 


Config* 


Global Reset Register 




0X37 


mm_geir 


Cortfig2 


Global Error Register 




0x38 


mm^geip 


Config2 


Global Exception Resis:er 




0x39 


inm_gint 


Conftg2 


Global Interrupt Register 




0x3A 


mm_activc 


status 


Global Active signals 




0x3B 










0x3C 










0x3D 










0x3E 










0x3F 










Instruction Controller Registers 




0x40 


ic_cfg 


Config? 


Configuration Register 






ic_stat 


Status/ 
. Interrupt 


Status Register 




0x42 


ic^enrjnt 


Interrupt 


Error and Interrupt Reciter (write to clear 
error and interrupt) 




0x43 


ic_crr_ini_en 


Config2 


Error and Interrupt Erabie Register 




0x44 


ic_ipB 


ComoU 


A stream Instruction Pointer 




0x45 


ic_tda 


Configl 


A stream Todo Register 




0x46 


ic_fha 


Control] 


A stream Finished Register 




0x47 


ic_tnta 


Config 1 


A scream Interrupt Register 





0x48 


Icjoa 


Status 


A stream Last Overlapped Instruction 
Sequence number 





0x49 


Icjpb 


Control I 


B stream Instruction Pointer 




0x4A 


ic_tdb 


Config 1 


B stream Todo Register 




0x4B 


ic_fnb 


Control 1 


B stream Finished Register 




0x4C 


■icjntb 


Cctnfigl 


B stream Interrupt Register 




Ox4D 


icjob 


Status 


B stream Last Overlapped Instruction 
Sequence number 




0x4E 


ic_sema 


Status 


A stream Semaphore 




0x4F 


ic_semb 


Status 


B stream Semaphore 






Data Cache Controller Registers 


page 42 


0x50 


dcc_cfgl 


cortfi*2 


DCC configuration 1 register 




0x51 


dccLStat 


status 


state machine status bits 




0x52 


dce_ar_iitt 


status 


DCC error status register 




0x53 


dcc_eir_int_en 


control I 


DCC error interrupt enable bits 




0x54 


dcc_cfg2 


comrol2 


DCC configuration 2 register 
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Table 1-1 the coprocessor registers 













Ox 5 J 


dcc_addr 


configl 


Base address register for special address 
modes. 




0x56 


occ 1 vu 


control I 


"valid" bit status for lines 0 to 3 1 




\JaD f 


OCCjvl 


conuoll 


"valid" bit status for Hr.es 32 to 63 




nvco 


dccjv2 


controll 


'Valid" bit status for lines 64 to 95 






dccjv3 


controll 


"valid" hu status for lines 96 to 127 




Ox5A 


dcc_raddrb 


status 


Operand Organiser B request address 




UaJD 


dcc_raddrc 


status 


Operand Organiser C request address 




Ox5C 


dcc_tesi 


conuoll 


DCC trst register 




UX3U 










A r C|J 

UXJC 










OxSF 












Pixel Organiser Registers 
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0x60 


PO-Cfg 


Config2 


Configuration Register 




0x61 


pGLSttl 


Status 


Status Register 




0x62 


poucirjrii 


interrupt 


Errotflntcrrupt Status Register 




0x63 


po.errjntjen 


COTiflgl 


Error/Interrnpt Enable Register 




0x64 


po_dmr 


Configl 


Data Manipulation Register 




0x65 


po_cubst 


Confi»2 


Substitution Value Rc fitter 




0x66 


po_cdp 


Status 


Current Data Pointer 




0x67 


pojen 


Controll 


Length Register 




0x6$ 


po_$aid 


Control t 


Oie copiocesson Address or Immediate Data 




0x69 


pojdr 


Control 2 


linage Dimensions Register 




0x6A 


po_muv^valid 


Control 


MUVvalidbits 




0x6B 


po.nxuv 


Configl 


Base address of MUV RAM 




0x6C 










OXoJLJ 










UXOli 










OX or 












Operand Organiser B Registers 
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Config2 


Configuration Register 




0x71 . 


eefe>_0tt 


Status 


Status Register 




0x77 


oob_errjnt 


Interrupt 


EnxK/lntcrrupt Register 




0x73 


oob^errjnt.en 


Conflg2 


Enoi/lntemipt Enable Register 




0x74 


00b_dmr 


Config2 


Data Minipolation Register 




0x75 




Cortftg^ 


Substitution Value Register 




0x76 


Oob_cdp 


Status 


Current Data Pointer 




0x77 


oobjen 


Control I 


Input Length Register 




Ox/8 


oob.said 


Controll 


Operand the coprocessor! Address 




0x79 


oab_tilc 


Control 1 


Tiling length/offset Register 




0x7A 
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Table 1 .1 the coprocessor registers 



J i Ul 1 1 U til J; 










/wrn 
UX Jd 










UX /L 










QX7D 










Ox7E 






... _ f 


0x7F 










Operand Organiser C Registers 
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0x80 


ooc_cfg 


Config2 


Configuration Register 




0x81 


ooc__stat 


Status 


Status Register 




0x82 


ooc_crr_int 


Interrupt 


Error/Interrupt Register 




0x83 


ooc_err_im_en 


Config2 


Error/Interrupt Enable Register 




0x84 


ooc__dmr 


Config2 


Data Manipulation Register 




Ox8S 


OOC_5UbSt 


Config2 


Substitution Value Register 




0x86 


ooc_cdp 


Status 


Current Data Pointer 




0x87 


oocjen 


Controtl 


Input Length Register 




0x88 


ooc_said 


Control I 


Operand the coprocessor! Address 





0x89 


ooc_tilc " 


Control I 


Tiling lenguVoffsct Register 




OxSA 










OxSB 










0x8C 










Ox8D 










Ox8E 










Ox8F 










JPEG Coder Registers 


*a 


0x90 


jc_cfg 


Config2 


configuration 




0x91 




Status 


status 




0*92 


Jcjcnjnt 


Interrupt 


error and Interrupt stams register 




0x93 


jc_crrjnt_cn 


Conflg2 


error and Interrupt enable register 




0x94 


jc_jsi 


Configl 


rethe coprocessoit interval 




0x95 


jc_decode 


CoturoI2 


decode of cuirent instrjetion 




Ox96 


jc_ies 


Control I 


residual value 




0x97 


jC_table_scl 


Control2 . 


table selection from decoded instruction 




0x98 . 










0x99 










0*9A 










0*9B 










0x9C 










Ox9D 










0x9E 










0x9F 














Main Data Path Registers 
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OxAO 


mdp.cfg | Config2 


configuration 
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Table 1.1 the coprocessor registers 









£ ^s^^p;jt^|f^.C^s c n p Up ng £ j: • ; - ■ «^ jsp^Hi 




OxAl 


mdp_5tat 


Status 


status 




0xA2 


mdp_crr_int 


Interrupt 


crrar/intcriupt 




0aA3 


• • iu y C 1 1 1 ill C-J l 


Config2 


enor/inierrupt enable 




0xA4 




Config2 


test modes 




OxA5 


inup__op 1 


Control^ 


current operation 1 




0xA6 


JTlup^OpZ 


Control! 


current operation 1 




0xA7 


nidp_por 


Control 1 


offsei for plus operator 




O* Aft 


mdp_bi 


Control I 


Wend the coprocessor! offset to index table 
enlry 




0xA9 


rndp_bm 


Control! 


Wend end or number of rows and columns in 
matrix, binary places, and number of levels in 
halftoning 




OxAA 


nidp.Ien 


Control 1 


Length of Nc^nd to produce 




OxAB 










OxAC 










OxAD 










OxA£ 










OxAF 












Result Organiser Registers 
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OxBO 


ro__cfg 


Config2 


Ccnfiguration Register 




OxBl 1 


rojstat 


Status 


Status Register 





OxBZ 


rcLprrJnt 


Interrupt 


Erratflmcrrupt Register 




OxB3 


ro.errjnucn 


Config2 


Enor/Intemrpt Enable Register 




0xD4 


rcLdmr 


Conf1g2 


Data Manipulation Redster 




OxBS 


ro_subsi 


Configl 


Substitution Value Register 




OxB6 


rcLcdp 


Status 


Current Data Pointer 




0xB7 


rojcn 


Status 


Output Length Register 




OxB8 


roLsa 


Configl 


the coprocessort Address 




0xB9 


rojdr 


Conflgl' 


Image Dimensions Register 




OxBA 










OxBB 


ro_vbasc 


Configi 


the coprocessor Virtual Base Address 




OxBC 


fOjCtit 


Configl 


Output Cut Register ' 




OxBD 


n>Jrnt 


Conflgl 


Output Length Limit 




OxBE 










OxBF 












; PCIhus Con figuration Space atlas 




OxCO- 
OxCD. 
OxCF 






A read onl y copy of PCI configuration space 
registers OxO to OxD end QxF. 


page 73 


OxCE 


pcLcxiemaLcfg 


Status 


32-bit field downloaded at reset from an 
external serial ROM. Has no influence on the 
coprocessor's operation. 





: ■* if* '» f 4 fc' « 
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Input Interface Switch Registers page 23 


OxDO 


iis_cfg 


Cpnfig2 


Configuration Register 




OxDl 


iis_stat 


Status 


Status Register 




OxD2 


iis_err_int 


Interrupt 


Interrupt/Error Status Register 




OxD3 


ii5_erx_int_cn 


Config2 


Interrupt/Error Enable Register 




OxD4 


iis.jc_addr 


Status 


Input address from IC 




OxD5 


iis_dce_addr 


Status 


Input address from DCC 




OxD6 


iis_po_addr 


Status 


Input address from PO 




OxD7 


iisjwist 


Status 


Burst Length from PO, DCC & IC 




OxDS 


iis_base_addr 


Configl 


Base address of the coprocessor memory 
object In host memory map 




OxD9 


iis_test 


Conftgl 


Test mode register 




OxDA 










OxDB 










OxDC 










OxDD 










GxDE 










OxDF 














OxEO- 
OxFF 






not used 





1-1.3 Register Definitions 



Miscellaneous Module Registers 
a. mm_cfg 

Table 1.2 mmjcfg Register Fields 





SP^rm^^i 




1 


E 


(Mtsabled 
l=enabled 


2 


B 


0=not bypassed 
I =by passed 
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b. mm_slat 
Table 1 .3 mm_stat Register Fields 













Reserved 



c. mm_errjnt 



Table 1 A m m_err Jnt Register Fields 



^^^^^^^^^^^^ 













d, mmjerrjnt_en 



Table 1.5 mm_eirjnt_en Register Fields 



e. mm_gcfg 



Table 1.6 rnm_jjcfg Register Fields 





lliiiilMfiiii 






0 


use_pci_clk 


(radonly) value of the use_pci_clk signal 
0=use clk_in to generate elk 
l=use pca_clk to generate elk 


1 


SlDW_Glk 


(read only) divide source clock by 2 to generate elk 

0=divide by 2 (default) 

lttdonotcfivtdeby2 


2 


cbus_coriig 


(readonly) 

0 * generic bus configured 
1 « external cbus configured 






Resetted 
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f. mmdiag 
Table 1.7 mm_diag Register Fields 







«*3™fc**»*«?$«*K^ Deserro 1 ion ^^iw** ^s^^- ■ 


t:0 


dtagjcfg[l:0) 


program which diagnostics appear on the diag_i pin: 
00- the coprocessor busy 
01=ncw A sire am instruction strobe 
I0=new B stream instruction strobe 
i 1 =new either stream instruction strobe 


3:2 


diag_dcfg[l:0] 


program which diagnostics appear on the diag_d pins: 

00=activity diagnostics: 

diag_d(0) = PCI bus master transaction 

diag_d [ 1 J - generic bus transaction 

diag_d[2] = local memory transaction 

diag_d [ 3 ) « peripheral interface transaction 

diag_d [4 ] -» non overlapped instruction active 

<Hag_d[5] « overlapped instruction active - 

01 ^instruction diagnostics: 

diag_d { 3 : 0 J - current instruction opcode 

diag_d f 4 ] = current instruction stream 

diag_d{5) = unused 

10=cachtng diagnostics: 

diag_d[Oj ~ data cache hit 

diag_d [l ] - servicing data cache miss 

diag_d{2] « TLB hit 

diag_d(3] - servicing TLB miss 

diag_dt4) ^MUVhlt 

d iag_d 1 5 ) « servicing MUV miss 

Install diagnostics: 

diag_d (0] = waiting on output from PO 
diag_d[l) = waiting on output from OOB 
diag_d [ 2 ) - waiting on output from OOC 
diag_d [ 3 J = stall due to ROfifo full 
diag_d [ 4 ] waiting on instruction fetch 
diag_d(5J - unused 
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g. mm_grst 
Table 1.8 mm_grst Register Fields 









15:0 


msist[ 15.-0] 


Per-module sofnvre resets. 






Write 1 to appropriate bit to cause software reset. These 






bits arc self clearing. 






bitO = External Ir.ierface Controller 






bit 1 = Local Memory Controller 






bit 2 -= Peripheral Interface Controller 






bit 3 = Miscellaneous Module 






bit 4 « Instruction Controller 






bit 5 « Data Cache Controller 






bit 6 -Pixel Organiser 






bit 7 = Operand Organiser B 






bit 8 - Operand Organiser C 






bit 9= JPEG Coder 






bit 10 -Main Dai; Path 






bit 11 = Result Organiser 






bit 12 « reserved 






bit 13 = Input Interact Switch 






bit 14 « reserved 






bit 15 « Software Reset ail modules 
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h. mm_gerr 
Table 1.9 mm_perr Register Fields 









31:16 


crr(15:0) 


Per- module error starus bits: 

Note thai these biis arc rcad only. To clear the error condition, 
the appropriate module error bit should be cleared. 

0 « no error condition from this module 

1 = error condition from this module 
bit 0 - External Interface Controller 
bit 1 - Local Memory Controller 

ou z = renpnerai interface Controller 

bit 3 = Miscellaneous Module 

bit 4 = Instruction Controller 

bit 5 - Data Cache Controller 

bit 6 = Pixel Organiser 

bi 1 7 « Opciaixi Organiser B 

bit 8 = Operand Organiser C 

bit 9 « JPEG Coder 

bit 1 0 « Main Data Path 

bit 11= Result Organiser 

bit 12 -reserved 

bit 13 = Input Interface Switch 

bit 14 » reserved 

bit 15 = error on any module 


15:10 


err_en[15:OJ 


Per-module error enables: 

0 - error conditions from this module are not translated into an 
interrupt of the host 

1 - error conditions from this module are translated into an 
interrupt of the host 

bit 0 - External Interface Controller 
bit 1 -Local Memory Controller 
bit 2 - Peripheral Interface Controller 
bit 3 = Miscellaneous Module 
bit 4 = Instruction Controller 
bit 5 ~ Data Cache Controller 

* 

bit 6 ** Pixel Organiser 

bit 7 « Operand Organiser fi 

bit 8 ~ Operand Organiser C 

bit 9 - JPEG Coder 

bU 10 = Main Data Path 

bit 1 1 o Result Organiser 

bit 12 = reserved 

bit 13 « Input Interface Switch 

bit 14 o reserved 

bit IS - interrupt on any module 
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mnrngnxp 



1.10 mm_gexp Register Fields 




31:16 expfl5:01 



Per-rnodule exception- smnis hue 



Note thai these bits are read only. To clczr the exception condi - 
tion, the appropriate module exception bit should be cleared. 

0 = no exception condition from this module 

1 = exception condlcion from this module 
bit 0 - Externa) Imerrace Controller 

bit I = Local Memory Controller 

bit 2 o Peripheral Interface Controller 

bu3 - Misccttaneous Module 

bit 4 « Instruction Controller 

bit 5 - Data Cache Controller 

bit 6 = Pixel Organiser 

bit 7 - Operand Organiser B 

bit 8 » Operand Organiser C 

bit9-JPEGCoder 

bit 10 = MamDataP3th 

bit U - Result Orgsruser 

bit 12 = reserved 

bit 13 - Input Interface Switch , 

bit 14 = reserved 

bit 15 = exception en any module 



0 - exception conditions from this module are not translated 
into an i nterrupt of 'jk host 

1 = exception conditions from this module are translated into 
an interrupt of thtthD$t 

bit 0 - External Interface Control] cr 

bit 1 - Local Memory Controller 

bit 2 - Peripheral Interface Controller 

bit 3 o Miscellaneeis Module 

bit 4 - Instruction Controller 

bit 5 = Data Cache Controller 

bit 6 » 1^ Organiser^ 

ttt 7 - Operand Organiser B 

bit 8 -Operand Organiser C 

Wt9 = JPEOC0der 

bit 10o Main Data Path 

bit 1 1 * Result Organiser 

bit 12 -reserved 

Ml 13 - Input Interface Switch 

bit 14 -reserved 

bit 15 ■exception on any module 



15:0 



exp_enU5;0] 



Pcr-module exception enables: 
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I IB 

I mm^gint 
Table 1 .1 1 mm_gint Register Fields 









31:16 


iru(i5:0] 


Per- module interrupt status bits: 

Note that these bits are read only. To clear the interrupt condi- 
tion, the appropriate module interrupt bit should be cleared. 

0 - no interrupt condition from this module 

1 = interrupt condition from this module 
bit 0 = External Interface Controller 

* bit 1 - Local Memory Controller 

bit 2 » Peripheral Interface Controller 

bit 3 = Miscellaneous Module 

bit 4 » Instruction Controller 

bit 5 - Data Cache Controller 

bit 6 » Pixel Organiser 

bit 7 = Operand Organiser B 

bit 8 - Operand Organiser C 

bit9=JPEGCoder 

bit 10 = Main Data Path 

bit 11= Result Organiser 

bit 12 « reserved 

bit 13 = Input Interface Switch 

bit 14 = reserved 

bit 15 » interrupt on any module 


15:0 


int_en[15:0] 


Per-module interrupt enables: 

0 = interrupt conditions from this module are not translated 
into an interrupt of the host 

1 = interrupt conditions from this module are translated into an 
imerruptof thehost 

bit 0 mx External Interface Controller 
bit 1 » Local Memory Controller 
bit 2 - Peripheral Inteiface Controller 

Mf b K/ftCf*^ll avw*/Mie \Ar*At%\m 

bit 4 e Instmction Controller 

bit 5 - Dau Cache Controller 

bit 6 = Pixel Organiser 

bit 7 » Operand Organiser B 

bit 8 = Operand Organiser C 

bit 9 ° JPEG Coder 

bit 10 = Main Data Path 

bit 1 1 = Result Organiser 

bit 12 -reserved 

bit 13= Input Interface Switch 

bit 14= reserved 

bit 1 5 - interrupt on any module 
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k. mm_active 
Table 1.12 mmactive Register Fields 









15:0 


maccive(15:0] 


Per-module active signals (Read Only): 






These bits unlatched 






bit 0 = External Interface Controller 






bit I - Local Memory Controller 






bit 2 = Peripheral interface Controller 






bit 3 = Miscellaneous Module 






bit 4 - Instruction Controller 






bit 5 = Data Cache Controller 






bit 6 » Pixel Organiser 






bit 7 - Operand Organiser B 






bit 8 - Operand Organs ser C 






bit 9 « JPEG Coder 






bit 10 » Main Data Path 






bit 1 1 = Result Organiser 






bit 12- reserved 






bit 13 - Input Interface Switch 






bit 14= reserved 






bit 15 = any module active 


31:16 


f 


Reserved 



Instruction Controller Registers 



I. ic_cfg 

The lc_cf g register is divided into three pans. The least significant byte contains global con- 
figuration infoxmati on, The third least significant byte conrairts A stream configuration infor- 
mation and the most significant byte contains the B stream configuration information. 

This register has a reset value of OxOOOOOOOO. 
Tabic 1 11 a 1c_cbj Register Relds 









1 


E 


Enabled 
O-disabied 


2 


B 


0*noi bypassed 
1-bypassed 


3 


S 


r>=not in self test mode 
l«scff test mode 


[5:4] 


pn*l1:0] 


instrualon stream priority: 
0=»A* high, B low 
1«B high, A low 
2=3=round roWn 
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zoo 



Table 1.13 ic_c<g Register Fields 









6 


asr 


Asynchronous s:3Ji reject: 
0~not enabled 
1 -enabled 


7 


pd 


Partial Decode enable: 
o=not enabled 
1 enabled 


8 


cbus_eic dis 


Disable nvmp^ic frnm rhp Vic* F/\r /""Kmc mwIop 
u,j<j vi^ iwjuvdo iiuiu uic niVw. ioi * i jus cycles 

(Mo not disable 

I^disabte 


9 


cbus_rnrn_dis 


Disable requests from the MM for Cbus cycles 
0=do not disable 
!=di sable 


10 


cbusjnt dis 


i^imvm^ (mjucmj uimi viisz iv_- ior v^DUS CYCICS 

0-do not disable 
[^disable 


1S 


a_en 


0«A stream disabled 
I -A stream enabled 


17 


a_seq 


0=A stream sequence numbers disabled 
1~A stream sequence numbers enabled 


18 * 


a_prefetch 


0«A stieam prefetching disabled 
1=A stieam prefetching enabled 


19 


a_overlap_jen 


0=A stream overlapping disabled 
1«A stream overlapping enabled 


20 




0 « A stream do not pause on "sequence number com- 
pleted" interrupt 

1 «=A stieam pause on "sequence number completed** 
interrupt 


21 


a_to_pause 


0 « A stream do not pause on u instrucrion completed" 
mterrupt 

1 ■» A stream pause on ^instruction completed" inter- 
rupt 


22 


a_auto_prime 


0»A stieam "sequence number completed" interrupt 
not automatically primed 

1«A stream "sequence number completed" interrupt 
automatically primed 


24 




0=B stream disabled 
1=B stream enabled 


25 


b_seq 


0-B stream sequence numbers disabled 
l=B stream sequence numbers enabled 


26 


b_prefetch 


0=B stream prefetching disabled 
1«B stream prefetching enabled 


27 


b_overlap_en 


0°B stream overlapping disabled 

1-B stream overlapping enabled j j 
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.13 ic_cfg Register Fields 









28 


b_snc_pause 


0 «= B siream dt> not pause on "sequence number com- 
pleted" interrupt 

1 - E scream pause on "sequence mimter completed" 
iruerrupt 


29 


bjc_pause 


0 = B stream do not pause on "instruction completed* 
interrupt 

1 - B stnram p3usc on "insu.^Jon completed" inter- 
rupt 


30 


b_auto .prime 


0-B stream "sequence number completed" interrupt 
not automatically primed 

1=B stream "sequence number completed" interrupt 
automatically primed 


m. ic_stat 

This register is split into four sections. The least significant byte contains the internal state of 
the IC The second least significant byte contains the decoded result of the current instruc- 
tion, and current and prefetched instruction stream. The second most significant byte con- 
tains all status information about the A stream, and the most significant byte contains 
information about the B stream. 

The reset value of this register in QxOOOOOOOO. 
Ic_stat Register Fields 










ec_state[3:0) 


instruction execution state condition. 
Q-idle 

1 o fetching instruction 

2= wailing for overlapped instrucdon to finish 
3» decoding 

4= prefetching instruction 

5= wait for instruction to finish 

6- synchronise with externa] accesses prior to updat- 
ing registers 

7- update state registers 

8- bad state 


8 


overlap 


0 « overlapped instruction not in progress 

1 « ovedapped Instruction in progress 


9 . 


icjnterrupt 


0 » will not iruerrupt when completed 

1 ■* will interrupt when completed 


10 


jump 


0 = current instrucdon is not a jump instrucdon 

1 - current instrucdon is a jump instrucdon 


11 


prefetched 


0 = the next instruction is not prefetched into the 
prefetch buffer 

1 « the next instruction is prefetched into the prefetch 
buffer 



( 515 ) 



1-859 



ic_slat Register Fields 



lllliiifiil 


^;' : ^>:^ N.arn s^^js 


.rf*#^ 5 Descrlp ti oit-^^-^$fe-&£ : % i: " : ; ' 


12 


stream 


currently executing (non-overlapped instruction) 
stream: 

I = stream B 


11 
i \> 


JJf Cl olitTdl U 


stream of instruction prefetched: 

0 = stream A 

1 ~ stream B 


14 


condition 


0 - jump condition not met 

1 - jump met 


16 


a_wait 


0 - A stream is running normally 

1 *= A scream paused because semaphore is taken 


17 


a_async 


0 - No asynchronous transfer in progress in stream A 

1 = Asynchronous transfer in progress in stream A 


18 


a_busy 


0 = A stream not busy 

1 = A stream busy 




a_sem 


A stream register semaphore status: 

0 - semaphore not claimed by anyone or claimed by 
hardware 

1 = semaphore held externally 

The agent recieving the semaphore will see a "0" in 
this bit. 


20 


ajock 


0 *= A stream not locked in 

1 «=A stream locked in 


ill 


a primed 


0 «* A stream "sequence number completed** interrupt 
not primed 

1 =A stream "sequence number completed** interrupt 
primed 


22 


a_paused 


A stream pause 

0 « A stream is not paused due to interrupt or error 

1 = A stream Is paused due to interrupt or error 
rcthe coprocessor! execution by writing 1 


23 


a_oLprimed 


0«A stream "overlapped instruction sequence number 
completed** interrupt not primed 
1 » A stream "overlapped instruction sequence number 
completed** interrupt primed 


24 


b_wait 


0 » B stream is running normally 

1 = B stream paused because semaphore is taken 


25 


b_async 


0 - No asynchronous transfer in progress in stream B 

1 = Asynchronous transfer in progress in stream B 


26 


b_busy 


0 = B stream not busy 

1 = B stream busy j 
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Table 1.14 ic_stat Register Fields 







IlillllS^^ 


27 


b_sem 

- 


B stream register seiuaphore status: 

0 = semaphore not claimed by tiny one or held by tfic 

coprocessor 

1 = semaphore held externally 

The agent recieving the semaphore will see a "0" in 
this bit. 


28 


bjock 


0 = B stream not locked in 

1 = B stream locked in 


29 


b_primed 


0 = B stream "seqience riumber completed" interrupt 
not primed 

1 = B stream "sequence number completed" interrupt 
primed 


30 


b_paused 


B stream pause 

0 - B stream is not paused due to interrupt or error 

1 = B stream is paused due to interrupt or error 
rethe coprocessor! execution by writing 1 


31 


b„ol_primed 


0 = B stream "overlapped instruction sequence number 
completed** interrupt not primed 
] =B stream "overiappedinstnicti on sequence number 
completed" interrupt primed 



n. ic_errjnt 

This register contains active high flags indicating whether an error or interrupt has occurred 
within the Id Each bit can be cleared by writing a *V. 

The reset value of this register is OxOOOOOOCO. 
Table 1.15 ic_err_Jnt Register Fi elds 



^^^^ 






B 


ajlLerr 


A stream illegal imtiuction error 


16 


bjlljerr. 


B stream illegal instructionenor 


24 


a^sncjnt 


A stream "sequence number completed" interrupt 


25 




A stream Instruction completed"* interrupt 


26 


fijsnsjnt 


A stream "sequence number the coproccssoitcd** inter- 
rupt 


27 


ajsjnt 


A stream "instruction the coprocessorted" interrupt 


28 • 


tajsncjnt 


B stream "sequence fwinber compleiea'Virtfernipt 


29 


bjcjnt 


B stream instruaion completed" interrupt 


30 


b_sns_int 


B stream " sequence number the coprocessoried* inter- 
rupt 


31 


b_is_int 


B stream " instruction the coprocessorxed" tmerrupi 
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o. ic_err_int_en 

This register contains the masks use*; 10 enable :r.c various errors or imcrrupis and has 
reset value of 0x00000000. 

Table 1.16 ic_err_int_en Register Fields 









8 


a_tfl_err_en 


A sucam illegal instruction error enable 


16 


bJILcrL.cn 


B stream illegal instruction error enable 


24 


a_snc_jnt_en 


A stream "sequence number completed" interrupt enable 


25 


a_ic_inLen 


A stream "instruction completed" interrupt enable 


26 


a_jsns_int 


A stream "sequence number the coprocessortcd" inter- 
rupt enable 


27 


a_jsjnt 


A stream "instruction the copKxrssunctT Interrupt ena- 
ble 


28 


b_snqjnt 


B stream ''sequence number completed" interrupt enable 


29 


b_fcjnt 


B stream "instruction completed" interrupt enable 


30 


b_sns_int 


B srream " sequence number the coprocessorted" inter- 
nipt enable 


31 


b_fejnt 


B stream ** instruction the coprocessoned* interrupt ena- 
ble 



P- icjpa 

This register holds the most significant 30 bits of the next virtual address to be used to fetch 
an instruction for the A stream. The least two sigrificant bars are assumed to be zero as 
instructions must be word aligned. 

The reset value of this register is 0x00000000 
Table 1.17 icjpa Register Fields 





ipa[3li] | A sntara^risrj^ 


[3130} 





q. icjda 

This register holds the "to do" value for the A stream. This is a 32 bit (wrapping) sequence 
number up to which valid instructions exist. 

The reset value of this register is 0x00000000 
Table 1 .18 icjda Register Fields 





tda[31:0] ^| A stream do" 


[31:0| 





r. icjna 

This register holds the "finished" value for the A stream. This is a 32 bit (wrapping) 
sequence number indicating the last completed instruction. 
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l he roset v;itue ofihis register is 0*0C-'" v XX)0. 
Table 1.19 icjna Register Fields 







f31«] | fna[31:0] 


A stream u fr rushed" value 



s. ic jnta 



This register holds ihe "interrupt" sequence number for the A stream. Tliis is a 32 bit Onn- 
pm*) sequenrc number at which 10 "interrupt at" if the mechanism i 5 cnaWcd and primed 
The reset value of this register is OxOGOOQOOO 



Table 1 .20 icjnta Register Fields 




t icjoa 

This register holds the 32 bit (wrapping) sequence number of the last overlapped instruction 
to be executed on the A stream. 

The reset value of this register os 0x00000000 
Table 1 .21 icjoa Register fields 





HUH 




[31:0] 


loa{31:0] 


A sticam "last overlapped instruction sequence'* 
value 



icjpb 

This register holds ihe most sipiifjcam 30 bits of the next vim»l address io be used io fetch 
an instruction for the B stream. The least two significant bits are assumed to be zero as 
instructions must be word aligned. 

The reset value of this register is 0x00000000 
Table 1.22 Icjpb Register Fields 













1 fpb[3t:2] | B stream instruction pointer 







v. ic tdb 



This itginer hotds the -to do" value for the B stream. This is a 32 bit <wn©piito) sanience 
number up to which valid instructions exist. 

The reset value or this register is Oioooooooo 
Table 1 .23 ic Jdb Register Fields 
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w. icjnb 

™s roister holds the "finished" value for the B stream. This is a 32 bit (wrapping 
sequence numter indicating the last completed instruction. 

The reset value of tltis register is OxCX)0ODOO0. 



Table 1 .24 icjnb Register Fields 









[31:0] 


fnb[3l:0] 


B stream "finished" value 



icjntb 

This agister holds the "interrupt" sequence number for the B stream. This is a 32 bit (wrap 
pmg) sequence number at which to "interrupt at" if the mechanism is enabled and primed 
The reset value of this register is 0x00000000 
Table 1.25 icjntb Register Fields 







' fJ sn-eam ime^^ ^ " * '"' 


(31:0] 


intb[31;0] 





y. ic_lob 

This regisier holds the 32 bit (wrapping) sequence number of the last overlapped instruction 
to be executed on the B stream. 

The reset value of this register is 0x00000000 ' 
Table 1.26 icjob Register tlelds 











lobpno] 


B stream "last overlapped instruction sequence*" value 



ic_sema 

This register is an alias for the lc_stat register with the side effect that a read of this register 
is a request for the A stream register semaphore. 

aa. icjsernb 

This register is an alias for the Ic.stat register with the side effect that a read of this recister 
is a request for the B stream register semaphore. 
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Input Interface Switch Registers 
ab. iiscfg 

Table 1.27 iis_cfg Register Fields 







:■ *r "A^w&im 1 tlon « : « ^ ;: -^&fY- 


(31:14] 


| Reserved 


[13:12] 


po_p 


priority of PO (0 = lowest, 2 highest) 


[ll:IOJ 


dcc__p 


priority of DCC (0 = lowest, 2 « highest) 


[9:8] 




priority of 1C (0 - lowest, 2 - hiehest) 


[7:4] 




Reserved 


PJ 


S 


0 - not in self test mode 

1 = self test mode 


[2] 


B 


0 *= not bypassed 
1 « bypassed 


[13 


E 


0« disabled 
1 - enabled 



ac. ils_stat 



Table 1 .28 iis_stat Register Fields 







"iteerved ^ ^ ' ^ ^ ^ -^-^ 


[31:25] 






[24] 


Lpiefctch_po 


Mate of i_pref ex.ctx_po signal 


[23] 


ij»efctch_da 


state of i_pref etch_dce signal 


[22] 


i_prcfetchjc 


state of i_prefetcruic signal 


[21:20J 


l™c_POjpri 


rotating priority of POarbitrating for the LMC 


[19:18] 


lmc.dccjpri 


rotating priority of DCC arbitrating for the LMC 


[17:16] 


lrncjcjpn 


rotating priority of IC arbitrating for the LMC 


[15:14] 


cie_poJprj 


rotating priority of POarbitrating for the HC 


[13:12] 


dcdccjpri 


rotating priority of E?CC arbitrating for the EIC 


[11:10] 


eicjcjpri 


routing priority of IC arbitrating for the HC 


9 


Imc^^ncpo 


I - PO is granted control of LMC 


8 ' 


Imc^nt^dcc 


1 - DCC is granted control of LMC 


7 


lrric_gnUc 


1 « IC is granted control of LMC 


6 


tic_gnt_po 


l « PO is granted control of EIC 


5 


eic_£nt_dcc 


1 « DCC is granted control of EIC 


4 


eic_gra_ic 


1 - IC is granted control of EIC 


3 


valid_req_po 


1 - legal request made to IIS by PO 


2 


vaJid_rcq_dcc 


1 - legal request made to IIS by DCC 
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Table 1.28 iis_stat Register Fields 









i 


val id_req_ic 


1 = legal request made to IIS by 1C 


0 


Lsource 


state of i_ source sienal 


ad. iis_err_int 
»s_eirjnt Regi 


ster Fields 


Mm 






[31:16] 


interrupt 


Interrupt condition 


[15:0J 


■ error 


error condition : 

bit 3 « IBus request made to IIS while disabled 
bit 2 = read request from illegal the coprocessor 
location 

from Pixel Organiser 

bit 1 = rc2d request from illegal the coprocessor 
location 

from Dat2 Cache Controller 

bit 0 - read request from illegal the coprocessor 

location from Instruction Controller 



ae. Us err ml en 



Table 1 .30 iis jarrjn t jen Reg ister Fields 









[31:16] 


int_mask 


i nterrupt mask 


115:0] 


errjnask 


error mask: 

bit 3 « IBus request made to IIS while disabled 
bit 2 - read request from illegal the coprocessor 
location 

from Pixel Organiser 

bit 1 = read request from illegal the coprocessor 
location 

from Data Cache Controller 

bit 0 « read request from illegal the coprocessor 
location 

from Instruction Controller 
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Table 1.31 iis_ic_addr Register Fields 











ic_aaar 


j address for data currently requested by IC * 



ag. Hs_dcc_addr 



Table 1.32 iis_dcc_adcir Register Fields 



mmmm 




'mmmmmmmmmmmmm 


[31:0] 


dcc_addr 


Address for data currendy requested by DCC 



ah. iis_po_addr 



Table 1 .33 iis_po_addr Register Fields 







^dr^for fata c!i^ ^ 


Pl:0] 


po_addr 





ai. ils burst 



Table 1.34 iisjburst Register Fields 









[13:0) 




Reserved 


[21:16] 


po_burst 


Burst length from PO 


[13:8] 


. dcc_burst 


Bum length from DCC 


[5:0] 


ic_burst 


Burst length from IC 



a], ns_base_addr 



Table 1 .35 ils Jjase_addr Register Relds 









(31:12] 


the 

copnoce$sor_base 
[19:0] 


Base address of the coprocessor memory object 
in memory map. the coprocessor memory object 
is aligned on a page boundary. 


[11:0] 




Reserved 
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ak. iisjest 
Table 1.36 iisjest Register Fields 









Pl:4] 




Reserved 


[3:0] 


test_mode[3:0) 


0 = IC -> EIC, PO -> LMC 
0 = IC •> EIC, DCC -> LMC 
0 = PO -> EIC, IC LMC 
0 ~ PO -> EIC, DCC -> LMC 
0 = DCC -> HC, IC -> LMC 
0 - DCC -> EIC. PO -> LMC 



External Interlace Controller Registers 



a I. eic_cfg 
Table 1 .37 eic_cig Register Relds 



mmm 






0 


reserved 




i 


enable 


Enables all EIC operation as a master on PCI and 
Generic buses. 


2 


bypass 




^ 


reserved 


Enables the operation of one or more of the EIC 
bypass modes. 


5:4 


pcLarb 


PCI Bus Arbitration: 

00 « Fair arbitration 

01 - IB us has priority 
10 = RBus has priority 
Us Fair arbitration 


6 


pcLibus_niiiUen 




7 


pcLrbus_minJcn 




S 


pa_mri_nirni_en 


Enables use of PCI memory read line and memory 
read multiple modes. 


10:9 


gciuarb 




11 


gen_cllc 


0 « Generic bus clock is 1/4 of elk 

1 - Generic bus dock is 1/2 of elk 


21:12 


mrou_mask 


Mask bits that determine the page size used in the 
MMU. 


29:24 


ha$h_func 


Determine the hash function used for page table 
lookups. 



( 524 ) 



1-8 5 



2M 



Table 1.37 eic_cfg Register Fields 









30 


ha sh_ bypass 


if 1 then bypiss tte ha$h function completely, so 
that the page table index is taken from the bottom 
1 3 bits of the virtual page number (regardl ess of 
P3-;c size). 


31 


mmu_bypass 


Forces the MMU io do null mapping on all 
requests. 


am. eic_stot 

Ncxe: bits in this register can change at any time. 
eic__slat Register Fields 




•mam 




0 


ibus_pci_act 


Indicates an active request from the IBus to the 
PQ bus. 


1 


rbfus_pct_acl 


Indicates an active request from the RBus to the 
PCI bus. 


2 . 


rnmu_act 


Intficates the MMU is in the middle of an opera- 
tioiu 


3 


gen_act 


Indicates an active request to the Generic bus. 


4 


ibus_pcL.gnt 


Ibus granted to PCI bus 


5 


rbus_pci_gnt 


Rbus granted to PCI bus 


6 


mrnu_pci_gnt 


MMU service granted to PCI bus 


31:4 


Reserved 





an. elc^errjnt 

Error and interrupt bits of the eic_errjnl register can only be set by the EIC, and can only 
be reset by software. Normal error and internet bits are reset by writing a 1 to that bit Error 
bits which are copies of PCI configuration register bits mast be cleared by writing to those 
registers - wnting the eic_err_int copy of the bit has no effect. 

Table 1 .39 eivjsnjnx Register Fields 







u 


page_/anli 


Setwhen a page fault error occurs. 


1 


proUauIr 


Set when a protection fault error occurs. 


2 • 


gerucrr 


Set when the Generic bus error pin is asserted for 
at least one cancel k cycle. 


3 


gen^addr_err 


Set when a Generic bus burst request crosses out- 
side the Generic bus memory region. 


4 


rwd_eir 


Request when disabled - set when the EIC 
receives an RBus or IBus request when its enable 
bit is zero. 


5 


target_abon_en- 


A copy of the Received Target Abon bit in the 
PCI configuration space. 
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Table 1.39 eic_err_jnt Register Fields 







0 


rnaster_aboit_crr 


A copy of the Signalled Target Abon bit in tlie 
PCI conficuracion space. 


7 


reserved 




8 


data_paricy_CA p 


A copy of ihe Detected Data Pariry Error bit in Lhe 
PCI configuration space. 


9 


gen_exp 


Set when the gen_exp_l pin is asserted for at 
least one can_clk cyde. 


23:10 


reserved 




24 


genjni 


Set when the gen_int_l pin is asserted for at 
least one gen„cl3c cycle. 


31:25 


Reserved 





ao. eic err Inl en 



Table 1.40 eic_errjnt Register Fields 



WBBBm 






0 


page_fault 


Enables the corresponding error bit to cause che 
assertion of c_err. 


i 


prot^fault 


Enables the corresponding error bit to cause the 
assertion of c_er r . 


2 


gen_err 


Enables the corresponding error bit to cause the 
assertion of c_err. 


3 


gen_addr_err 


Enables the corresponding error bit to cause the 
assertion of c_err. 


4 


rw<Lerr 


Enables tf2 corresponding enor bit to cause the 
assertion of c^err.. . 


5 


iaxget w abort_err 


Enables the corresponding error bit to cause the 
assertion of c_err. 


6 


master_abort_err 


Enables the conesprjnding error bit to cause the 
assertion of c_err. 


7 


reserved 




8 


daia_pariry_exp 


Enables the corresponding error bit to cause the 
assertion of c_exp. 


9 


gen_exp 


Enables the corresponding error bit to cause the 
assertion of c_exp. 


23:10 


reserved 




24 


genjnt 


Enables tfe corresponding error bit to cause the 
assertion of c_ int. 


31:25 


Reserved 
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ap- eic test 



Table 1 .41 eic jest Register Fields 









| 31:0 


Not yci defined 





aq. eic_pob 
Table i .42 eic_pob Register Fields 









0 


gcn_pob0 


Value of the genric bus programmable output pin 
pobO 


1 


gen_pobI 


Value of the *enric bus programmable output pin 
pobl 


ar. eic_high_£ 
eic_duaLaddr 


iddr 

Register Fields 








31:0 


lii£h_addr 


orrset of 4GB region in host memory in which the 
coprocessor data structures exist (for dual address 
cycles). 


as. eic__wrtlb_v 
eic_wtlb_v Reg 


Istor Fields 








-120 




the flags associated witn the TLB cmry to be 
written 


11:2 


Rcscnrcd 




31:12 


vpn 


Virtual Page number lo be written or invalidated. 
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at. cic_v,iib_p 
Table 1 .45 eic_wllb_p Register Fields 









0 


inv_all 


Causcs the entire TLB to be invalidated. The FJC 
ueais injs on alter performing the TLB operation. 


1 


inv_emry 


Invalidates a TLB emiv with rhe virmai n-»cn» 

a 11.B Will J VTIUl UIC VU I Ud] pd °C 

number specified. The EIC clears this bit after 
pctfunT-ins the TLB operation. 


2 


writejru 


Write* a TLB entry to the LRU location with the 
specified virtual page number, physical page 
number 2nd control bits. The EIC clears this bit 
after performing the TLB operation- 


3 


write 


Write the TLB entry specified by the Entry 
Number field with the specified virtual page 
number, physical page number and flag bits. The 
HC cleais Lhas tft after performing the TLB oper- 
ation 


7:4 


cni_nurn 


Entry n-imber to be written by Write Entry opera- 
tion. 


31:12 


ppn 


Physics! Page number to be written. 



au. eic_rnmii_v 

Note: values in this register can change at any lime, unless the MMU is disabled by a pace 
fault error or an MMU to PCI bus error- 
Table 1.46 eic_mrmi_v Register Fields 







' Cujnrcra Utij \ocs^^ " ™ 


3:0 


mraujru 




4 


mmujtit 


ramujiit is set when that last TLB access resulted 
in a hi without any page tabic access 


5 


mmu_retry_hit 


v to rnrnu_retry Jm is set when that last TLB access 
resulted in a hit after a page tabic access 


11:0 


Reserved 




31:12 


mmujvpn 


The most recent virtual page number sent to the 
MMU for translation. 



av_ - eic_mmu_p 

Note: values in irris register can clian°e at any time, unless the MMU is disabled by a page 
fault error or an MMU to PCI bus error. 

Table 1.47 eic_mmu_p Register Fields 









31:0 


rnmujpta 


The most recent page table physical address 
fachedbythcMMU 
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aw. eic_ip_addr 

Note: values in this rsgisiei can change at any lirse. unless the IBD is disabled by an IBu< t 
Paction ust 

Table 1.48 eicjp_addr Register Fields 









31:0 


libpa 


Physical address of Uic mosi recent IBus access to 
the PCI 3us. 



ax. eic_rp_addr 

Note: values in this register can charge at an* time, unless the RBR is disabled by an RBus 
to PCI error. 

Table 1 ,49 eic_rp_addr Register Fields 




31:0 



Irbpa 



Physical address of the most recent RBus access 
to the PCI Bus. 



ay. • eic_jg_addr 

Note: values in tliis register can chance ai any time, unless the GBC is disabled by a Generic 
Bus error. 

Table 1.50 e(cjg_addr Register Fields ' 









31:0 


libga 


Address of the most recent IBus access to the 
Generic Bus. 



az. eic_rg_addr 

Note: values in this register can chance at any time, unless the GBC is disabled by a Generic 
Bus error. 

Table 1,51 eic_rgL3ddr Register Fields 









31:0 


Irbga 


Address of the most recent RBus access to the 
Generic Bus. 



PClbus Configuration Space Alias 

The PClbus Configuration Space which consists of IS words is aliased to the registers at 
addresses QxCO to OxCF. 
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Local Memory Controller Registers 

ba. Imi_cfg 
1.52 lmi_cfg Register Fields 









[31:24] 


refjnterval 


Refresh interval in multiples of 4-clk periods 


[23:22] 


ro_prio 


RO access priority (3 = highest) 


(21:20] 


pic_prio 


PIC access priority (3 = highest) 


[19:i&] 


ic^prio 


IC access priority (3 - highest) 


[17:16] 


ns_prio 


JIS access priority (3 « highest) 


[15:13] 


rearbjnterval 


Re- arbitration interval in words (2 A n words) 


[12] 


rnem_enabJe 


Memory enable (1 = memory installed) 


[11] 


banks 


Number of DRAM banks connected {0-1 , 1 =2) 


[10] 


iis_flush 


Flush the US prefetch data (self clearing) 




iis_prefetch 


OS prefetching enable 


[8] 


refrcshjen 


Enable auto (GBR) refreshing of DRAM 


m 


seamless 


Enable seamless SDRAM page faulting 


[61 




reserved 


[5:4] 


bypass^modc 


Bypass mode select: 

00-Rbus 

01 = Abus write 

10 = Abus read 

ll=Ibus 


[3J 


S 


Self test mode 


[2] 


B 


Bypass enable 


[1] 


E 


Enable module 


[0) 




Reserved 



This register contains many configuration and control bits used to define LMC operation 
mode and parameters. Bits thai refer specifically to SDRAM operation have no effect when 
the 5drara_l pin is high. 

The register has a reset value of 0x20000100 setting the refresh interval to 3.2 us at a elkin 
frequency of 80 MHz. All special modes and features are disabled at power up and all access 
priorities are equal and set to zero. Refreshing is enabled at reset, but the rest of the module 
is disabled (E=0> Refreshing is not affected by the E bit. 
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bt>. Imi stat 
Table 1.53 lmi_stat Register Fields 









[31] 


ro_ca 


RO module cycle active 


f30J 


pic_ca 


PIC module cycle active 


129] 




reserved 


R8] 


iis_ca 


IIS module cycle active 


[27) 


ro_cp 


RO module cycle pendin° 


[26] 


Pic_cp 


PIC module cycle pending 


[25] 




reserved 


[24] 


»s_cp 


IIS module cycle pendi ng ~ 


[23:22] 




Reserved 


121.16] 


stiitcA 


Internal control state A 


[1S:13] 




Reserved 


P3:8] 


stateB 


Internal control state B 


[7J 


rd_active 


Reading DRAM 


[61 


wr_active 


Writing DRAM 


[5] 


rf_active 


Refresh in progress 


W 


rf_pending 


Refresh pending 


[3] 


iis_pre_active 


IIS prefetch active 


[21] 




Reserved 


[0] 


sdram 


State of the sdram_l input pin 



The aaus raster Romans module active and pending bits as well as internal state machine 
state .nfonronoa The state machine is elected at twice the Coos interface rate, so twoSelS 
are necessary to hold the state inTotmation, one for each of the last two 80 MH* dock cycles. 
All tats within this register are read only. 

be. lral_err_int 
Table 1 .54 lmi_err_int Register Fields 









[31:24] 


interrupt 


Interrupt status bits " 


[23:8] 


exception 


Exception status bits 


[7:0] 


error 


Error status bits 



Jte error and imerropt^usregister holcsintem.pt. exception and error condition informa- 
«on. The regmer is read / wnte where a read will return the status information ana wriiin. 



* 
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one 10 a particular bit will cause thai bit lobe reset. Writing a zero to any bit Das nocflcc. on 



thai bit- 
Table 1 .55 Interrupt Bit Definitions 





^^•ji^s^w^ sen piion^'w^*^^^ 


24 


Refresh Interrupt. Occurs once per refresh interval (64 ms typ.) 


25 


Refresh not serviced prior to subsequent refresh request 


Error Bit Defin 


— — — 1 

itions 






0 


Access to DRAM when no DRAM enabled 


i 


Access to DRAM while module disabled 


2 


Rbus Address Error 


3 


Abus Address Error 


4 


Cbus Address Error 


5 


Ibus Address Error 



This register should have a reset value of OiOOOOOOOO indicating that no interrupts or errors 
are asserted. Reserved bits are always zero and will never change state. 

,bd lmi_errjnt_en Register 
Table 1 .57 lmi_errjnt_en Register Fields 



■nan 






(31:24] 


int_mask 


Inteirupm 


[23:8] 


exp_rnask 


Exception mask bits 


[7:0] 


err_mask 


Error mask bits 



The error, exception and interrupt enable register is used to selectively enable and disable 
error, exception and interrupt signals. The register h read / write. This register is used to ena- 
ble on a bit-by-bit basis each of the error, exception and Interrupt signals in the ImLerr Jnt 
register There is a one-to-one correspondence between the bits in this register and the bits in 
the Iml.errjnt register. If a particular bit in the tmi_*rr>t_en register is set hi *h then the 
corresponding bit in the ImLerrJnt register will be enabled, and if it is high, the LMC mod- 
ule error, exception or interrupt output signal, c.err, cexp or c tnt will be asserted If a 
particular Imt^rrjnt^en bit is cleared to zero then the corresponding bit in the ImLerr Int 
registers disabled and cannot cause the assertion of the c.err. c ex p or c Jnt signal Since 
there are no exceptions in the LMC. the ex?_juask bits in this register have no efTect and 
are all reserved. 

This register has a reset value of OxOOOOOOOD which disables all error and interrupt sources 
Unused bits arc always rero and cannot be set high. 



7 
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be. Imr_dcfg 
Table 1.58 tn!_dcfg Register Fields 









[31:29] 


ro\v_bils 


Number of DRAM row address bits (n + 6) 


[28:26] 


rnlnmti hiic 

LUI Ui iUI_Ulii 


Number of DRAM colunui address bits (n + <$) 


f25) 




EDO type DRAM (0 - FP, 1 - GDO) 


[24:22] 


precharee_enable 


PTcchargc enable address bit (n + 5) 


[21:19] 


pn;chargc_baiik 


P:i'.i:ha:«e bank address bit (n + 5) 


[18:17] 


casjo 


Number of CAS low clock cycles (CAS pulse 
width) (1 to 4. 0=1) 


[16] 


cas hi 


wumDeror CAS nigh clock cycles (CAS pre- 
charge)(l to2,0«l) 


U5:i3] 


rasjo 


Minimum RAS low clock cycles (1 co 8, 0 = i ) 


112:11] 


ras_hi 


Minimum RAS high clock cycles (RAS prc- 
charge)(lto4,0«l) 


[10:9] 


ras_cas 


RAS to CAS delay in clock cycles <2 to 6> 0 - *>) 


[8:7] 


casjat 


SDRAM CAS latency in clock cycles: 
0, 1= 1 clock 

2 «= 2 clocks 

3 = 3clocks 


«5:0] 




Reserved 



This configuration register holds all the design parameters that determine the size and con- 
figuration of the DRAM chips used 

This register has a reset value of0x0O07FF80 which sets all umine constraints to their max- 
imum values. *" 

bf. lmi_mode Register 
Table 1 .59 lmi_mode Register Fields 









[31:14) 






(13J 


initialize 


iruualize SDRAMs (program mode register, seif 
clearing) 


[12:0) 


sdram_mode 


SDRAM mode bits (written to SDRAM mode 
register over the address pins by initialize 
SDRAM command) 



This configuration register holds the information that 1$ written into the SDRAM mode res- 
istor as pan of the initialiiation procedure. This register nu, be read or writ.en at any tirne 
and maj be written to the SDRAM by setting the initialize bit. 

This register has a reset value of Qx0037.This Is a useful default value that is required imme- 
diately after precnarging ai power up or after a level I reset. This sets read latency to 3 
clocks and burst length 10 full page with sequential wrap. 



( 533 ) 



#lffl¥l 1 -8 5 9 6 9 



X20 



A:ccr any reset. Ilic initialize bit will be set if ihc .dram 1 pin is low, «> initially program 
the SDRAM mode register. After die mode register write is performed, this bit win be 
cleared to zero automatically. 



Peripheral Interlace Controller Registers 



bg. pic_cfg 



Table 1.60 pic_cfg Register Fields 









0 


reserved 




i 


enable 


Enables all PIC operadoiL 


7. 


bypass 




3 


reserved 




4 


big-endian 


Causes all byte packing/unpacMng to use big- 
endian order, i.e. bits 31:24 Tint, bits 7:0 last 


5 


video mode 


0 ~ Video Output ~~ 

1 - Video input 


7:6 


reserved 




8 


gare_f>ut_dock 


Used in Video OuiDUt mode Cbhsm the rmrni* 
dock lube gated with vclK_en_in_L. 


9 


mv._o\JUClock 


Used in Video Output mode. Causes the output 
clock to be inverted with respect to vci k_in. 


10 


use_defaultjjata 


Used in Video Output mode. If this bit is seu the ~~ 
value in the dejauK_data register bit is driven 
onto the output data pins for cycles when data is 
iilh. vdJiu. uuerwise, me next valid data byte is 
driven if available. 


11 


defauJi_data 


Used in Video Output nxxle. The default data 
used if use_def ault_data is enabled. 


12 


data_cn_source 


Used in Video Output mode. When U 
vdat.a_en_out_l is asserted when 
vdat*_«n_in_l is sampled asserted. Other- 
wise, vdana_en_ou t_l is asserted when 
vclk_cn_irj_I is sampled asserted. 


13 


deLdata 


Used in Video Output mode. Causes the output 
data to be delayed one cycle with respect to 


14 


gate_wirh_rdy 


Used in Video Output mode. Causes the output 
clock to be gated wiih vrdy_l. 


15 


reserved 




16 


forwaid_actjdts 


Disables the assertion of c_act ive due to data 
in the output FIFO. 


17 


rcvcrsc_act_dis 


Disables the assertion of c_active due to data 
In the input FIFO. 


18 


rorward_prioriiy 


Relative priority or forward transfers when in a 
bidirectinml mode. 




( 534 ) 



ttHVl 1-8 5 



221 



.60 pic^cfg Register Fields 









19 


reversc_priority 


Relative priority of reverse transfers when in a 

W directions) mode. 


20 


f orwardj nvcn_data 


Inven ouipm data jignafs 


21 


revcrse_inven_ciaca 


Invert input data signals 


n 


forward Jnvcri_.cojitrol 


Invert outpu[ control s; -rials 


23 


reversej nvert^comroi 


Invert ouipui control signals 


30:24 


cbus^timer 


Maximum rime for which a CDus cycle io the 
input or output data stream can be sialled before, 
signalling an error. Specified in multiples of 8 
elk cycles. 


31 


cbus_timer_disable 


Disables the operation of lie CBus rimer. 



bh. picjstat 



61 prc_stat Register Fields 



BH 




Set wtoi^^us trartfa^^ or^n 


0 


ahus_act 


progress. 


1 


rt>us__act 


Set when an RBus transaction is pending or in 
progress. 


2 


reverse_act 


Set a when a reverse data transfer is active. 


3 


cent_c(g 


Copy of the external cent cfg pin. 


4 


forward_full 


Set when the PICTs output FIFO is full. 


5 


forward_subword 


Set when the PIC's output FIFO lias space for 
more than rero but less than four bytes. 


6 


forwanLempty 


Set when the PIC's output FIFO is empty. 


7 


reserved 




8 


reverse_fu!l 


,Sct when the PICs Input FIFO it full. 


9 


tevciscjunpgr 


Set when the Pl(?s input FIFO is empty. 


to 


rcverse.subword 


Set when the PIC's input FIFO contains more 
lhan zero but less than four bytes. 


15:11 


reserved • 




16. 


pic_plh 


Tbt state of the R ic^pih pin. 


17 


ptc^ackj 


Tne state of the pi e_ack_i pin. 


18 


pkjbusy 


The suae of the pic_busy pin, 


19 


pic_perror 


The state of the pic perrorpin. 


20 


picselea 


The State of the pic_seiect pin. 


21 


pic.faultj 


The state of the pic fault 1 pin. 


31:24 


pie.datajn 


The state of the pic_data_in signal. 
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bL picerrjnt 

Enor and imcmjpi bits of the pic_err im register can only be sei by the PIC. and 
DC reset by software. Each bit is reset by writing a 1 to that bit. 

Table 1.62 pic_err_Jnt Register Fields 









0 


iorwand_err 


Forward transfer error Set if the TIC receives out- 
put data tn transfer when configured for an input - 
only mode. 


1 


rwd_err 


Rcqucsiwtien disahted error. Set if the PIC is dis- 
abled when it receives request for an RBus trans- 

•3CTJOTL Or 2flV rt^OI \J PF-ini liof ^rl nn/i ration fAD..» 

uluwi « vi any n-.jji*«/i-iiu(i<nxu upr.rauon ^/Vdus 
transfer, Centronics command, etc.). 


2 


nmeouL_err 


Peripheral timeout error. 


8 


ic verse exti 


R€V£i$C transfer error. Set if software sttempts to 
the coptocessort a reverse transfer when the PIC 
is in an output only mode. 


9 


cbus_exp 


Set when a CBus cycle to the input or output data 
stream times out When this bit is set* further 
CBus cjcles to the input or output data streams 
are treated as reserved. 


24 


rev__comp_int 


Set when the byte-count limit has been reached in 
a reverse transfer. 


25 


rev_datajnt 


Set when data is available from a reverse transfer 
and there is no active bus transaction to indicate 
its destination. 


26 




Set when abus transfer completes 


27 


tirnerjra 


Set when timer 0 or 1 expires 


28 


tomp_faul[Jni 


Set in Centronics compatible mode when the 
pic_fauttj signal is asserted. 


29 


conip_jcnor_lril 


Set In Centronics compatible mode when the 
ptc_penor signal is asserted. 


30 


eq>^rcv_rccLint 


Set in Centronics ECP mode when the peripheral 
Indicates thai it has reverse data available. 


31 


nOL_peri phcral_int 


No peripheral interrupt 


bj. pjcjwrjtot 
pic_errjnl_en 


.en 

Register Fields 




^^^^^^^^ 




0 


forward_cir 


Enables the corresponding interrupt bit to cause 
the assertion of c^in t. 


1 


rwd_cir 


Enables the corresponding imenupi bit to cause 
the assertion of c:__in t. 


2 


ujneoutjerr 


Enables 1 he corresponding interrupt bit to cause 
the assertion of c_int. 
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lable 1.63 pic_errjnt_en Register Fields 









8 


reverse__exp 


Enahles the corresponding ituerrupt bit to cause 

the fl^SP rtirin nf r* i r\ — 


9 


cbus_e«tp 


Enables Lhe corresponding interrupt bit to cause 

the SSfirti of C Mil" 


24 


re v_cornp_im 


Enables the corresponding intcrrup: bit to cause 
uc flxseruon oi c_in t. 


25 


rev_data_int 


Enables the corresponding interrupt bit to c:v<r 
thf! ass.Tnon of c_itiL. 


26 


ahus^omp^int 


Enables the corresponding inccrrupi bit to cause 
the assertion of c int.. 


71 


tirnerjnt 


Enables the corresponding interrupt tit u> cause 
die assertion of c_in t. 


28 


comp_faulUnt 


Enables the corresponding interrupt bit to cause 
the assertion of c_int. 


29 


comp^perrorjnt 


Enables the coircspanding interrupt bit to cause 
the assertion of c_Ln t . 


30 


ecp_rev_req_jnt 


Enables the corresponding interrupt hit to cause 
the assertion of c_.in t. 


31 


nqj)eiiplicraljni 


Enables the corresponding interrupt bit to come 
the assertion oF e_ int. 


bk. pic_abus_ 
pic_abus_c*9 fi 


cfg 

teglster Fields 








23:0 


ab_count 


Number of bytes remaining to transfer to/from the 
LMC Set initially by software, and decremented 
by the PIC as each byte is transferred. 


27:24 


ab_byte_en 


Byte lanes to be used for ABus transfer 


28 


ab_iype 


1 = Transfer is a read (from peripheral pot) 
0 - Transfer is a write (to peripheral port) 


29 


ab_the coproces- 
son 


the coprocessor the programmed ABus transfer. 
This bit is cleared by the PIC when the ABus 
transfer hasljecn completed. If.dcartd by soft- 
«*nt, the ABus transfer Is terminated. 


31:30 


reserved 
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bL pic_abus_addr 
1.65 pic_abus_oddr Register Fields 









3i:0 


ab_addr 


The byie address ai which the next ABus iransac- 
lion should be the coprocessorted. 



bm. pic_cent..clg 

The plecem_clg register contain* read/write control signals and read-only status a°nal 
conrrol all aspect* o: the interface when Centronics mode is enabled. 



1 .66 pic_cent_cfg Register Fields 









2:0 


cen^cmd 


Centronics Command: 

wu = Aeiuin to Compatible 

001 ^Request Nibble 

010 » Transfer Device ID using NibbJe 

011-100 = no-op 

101 = Request ECP 

1 10 * Transfer Device ID using ECP 

HI = Abort 


3 


ccn.exe 


the coprocessorts execute of programmed Centro- 
nics command. This bit is reset when the Centron- 
ics command completes. 


4 


cen^sts 


This read-only bit shows the completion status of 
the previous Centronics command. A I indicates 
that the command was successful, a 0 that it was 
unsuccessful. 


7:5 


ccn_mode 


Read-only bits showing current Centronics mode 
and operation based cm the last negotiation: 

000 = Compatible 

001 = Compatible with Kibble enabled 
010= Transferring Device ID using Nibble 
Oil -Nibble Mode 

1 00 = reserved 

101 m ECP forward mode 

110 = Transferring Device ID using ECP 

111 - ECP reverse mode 


8 ■ 


cen_dircct 


Bypasses the Centronics control logic and allows 
the software to directly control tie output bats by 
writing and reading this register. 


9 


cenjiostjtcjen 


Enables host recovery mechanism in ECP mode. 


15:10 


reserved 




23:16 


cenjim 


Timer value, in cix cycles, for all Centronics 
setup, hold and pulse limes. 


31:24 


ccn_per_tim 


Timer value, in lots of 64k cl Jc cycles, when 
waiting for a Centronics peripheral response. 
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bn. pic_cent_dir 
Table 1 .67 pic_cent_dir Register Fields 



.i: i: .f^!d{bi]|^:! 






o 


pic_d3t3_oe 


Direct control bit for the pic_data_oe sisnal. 


I 


pic_strobe_l 


Direct control bit for the pic_strobe 1 sisnal. 


2 


pic_autofd_l 


Direct control bit for rhe pic_autofd 1 sisnal. 


3 


pic_iniij 


Dipt- ronrrol bit for the pic_init_l sisnal. 


4 


pic_sclecunj 


Direct control bit for the pi c_s elect in 1 sig- 
nal. 


5 


pic_buf_en_l 


Direct control bit for the pic_buf_en_l signal. 


6 


pic_buf_dir 


Direct control bit for the pic_buf_dir signal. 


7 


reserved 




15:8 


pic_data_out 


Direct control bit for the pic_data_ou t signal. 


31:16 


reserved 





bo. pic_reverse_cfg 



Table 1.68 pic_reverse_cfg Register Fields 





3 rn. 6^^^^^-- 




23:0 


rev_count 


Number of bytes remaining to transfer from the 
external peripheral. Set initially by software, and 
decremented by the PIC as each byte is trans- 
ferred. 


24 


rev_use_count 


Causes the reverse transfer to be terminated after 
the specified number of bytes have been trans- 
ferred. 


25 


revjhc coproces- 
sort 


the coprocessors the programmed reverse trans- 
fer. If the rev_use_count bit is set, this bit Is 
cleared by the PIC when the reverse transfer has 
been completed. The software can clear this bit to 
stop the reverse transfer at any time. 


31:26 


reserved 





bp. pic_timerO 



Table 1.69 pic_timerO Register Fields 









31:0 


limerO 


Timer value for first data timeout, in units of one 
elk. 
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bq. pic_timer1 
Table 1.7D picjirnerl Register Fields 







31:0 


timer I 


Timer value for data-to-dau timeout, in units of 
one elk. 



Data Cache Controller Registers 



br. dcc_cfg1 
Table 1.71 dccjcfgl Register Fields 





OKlisabled ^ • 


1 


E 


l=enabled 


2 


B 


G=not bypassed 
l=bypassed 



bs. dcc_cfg2 



Table 1.72 dcc_cfg2 Register Fields 







0 


Autoinv 


0=not in auto-invalidate mode (value upon reset) 

I =auto-invaiidatc cache at the end of each instruc- 
tion 


I 


Autofill 


O=not in autevfdi mode (value upon reset) 
1 ^auto-fill cache the coprocessorting at the 
address specified in dcc_addr. Cache is validated 
at the end of the operation. This bit is self-clear- 
ing. 


2 


Lock 


0=datais written to cache RAM after data fetch 
(value upon reset) 

l=data is not written to cache RAM after data 
fetch 


3 

■ 


alwaysjiit 


0=Tag memory and valid bits are checked (value 
upon reset) 

l«Data is assumed to be always valid and resident 
in cache 


4 j always_miss 


0=Tag memory and valid bits are checked (value 
upon reset) 

1-Data is assumed to be not resident in cache 
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31 VI 



Tabic 1.72 dcc_cfg2 Register Fields 



[10:8) 



II 



mode 



cache_miss_jnst 



current mode of caching operation: 
O=random access- normal mode (vaJuc upon " 
reset) 

1- /PEG Encoding 

2 - slow JPEG Decoding 

3=single channel colour space conversion 
4=mif. : .: channel colour space conversion 
5= matrix multiplication 
6=64 bit mode 

7=^1 invalidate all entries in cache. The DCC will 
reveit to mode 1 when all the •'line valid" bits 
have been cleared 

This field is reset at the end of each instruction. 



current mode of caching operation: 
0=Tag memory and valid bits are checked (value 
upon reset) 

1 =Data is assumed to be not resident in cache 
This bit is reset at the end of each instructioa 



bt. dec &tat 



Table 1.73 dcc_stat Register Fields 











error(3:0] 


error condition: 

bit 0«attempted R/W operation to cache or Tag 
memory with the module disabled. 




state[7:0] 


internal state condition. Details to be determined 



bu. dcc_erMnt 
Table 1.74 dcc^errjnt Register Fields 









0 


error 


attempted R/W operation to cache or Tag mem- 
ory with the module disabled 
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b v. d c c_e r M n t_en 
Table 1.75 dcc_err_int_en Register Fields 









0 error_en 


enable corresponding error 



bw. dec IvO 



Table 1.76 dccJvO Register Fields 









[31:01 . 


lvO(31«] 


valid bits for lines 31 down to 0 



bx. dccjvl 
Table 1.77 dccjvl Register Fields 







mmmmmammmmm 


[31:01 | lyl[3I:0] 


valid bits for lines 63 down to 32 



by. dccjv2 
Table 1 .78 dec Jv2 Register Fields 



131:0] V | lv2[31^ ^ 








valid bits for lines 95 down to 64 



bz. dccjv3 
Table 1 .79 decj v3 Register Fields 







[31:0] 


lv3[31:0] 


valid bits for lines 127 down to 96 
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ca. dec addr 



Table 1.80 dcc_addr Register Fields 









131:0] 


dcc_addr 


This is the base zddrcss register used for the spe- '. 
cial addressing modes. Note that different bits of 
this register are used depending on the opera- 
tional mode. 


cb. dcc_raddrb 




dccjraddrb Register Fields 




'WBwMBm 






[31:0] 


dcc_raddrb 


This is the status of Operand Organiser B*s 
address 



cc. decjaddre 



Table 1.82 dcc_raddrc Register Fields 









[31:0] 


dcc„raddrc 


This is the status of Operand Or/ganiser C*s 
address 



cd. dec test 



Table 1.83 dec Jest Register Fields 









0 


drive_odata 


This XA\ Is Only valid when the tcswnodc bits are 
set to 0, 

0»do not drive Operand Organiser data busses 
(value upon reset) 

l«driye pperand .Organiser data busses 


1 


o_ackb 


this bit is only valid when the lesr mode bks are 
set to 0. 

This bit is onl > useable when the DCG is in its 
test mode and the drive_odata bit is set This bit is 
self-clearing- 

0=*do not assert o_ackb (value upon reset) 
I -assert o_a ckb for one elk cycle 



I 



i 
! 

t Table 1.83 dccjest Register Fields 







..- v :; :■.«.•>■:■:■::.:■» :~-;:-.v -:■>>: *^ f>;**:*»ft**-...--. 


2 


o_ackc 


This bit is only valid when ihe test_mode bits are 
set to 0. 

This bit is only useable when the DCC is in its 
lest mode and the dri ve_odata bit is set. This bit is 
self-clearing. 

0=do not assert o_ackc (value upon reset) 
l=assert o_ackc for one elk cycle 


[5:3] 


iesi_mode 


0=drive odata busses mode(value upon reset) 
l=drive OOB address out to IIS, i„oe~0 
2-drive OOC address out to IIS, i_qe=0 
3=drivc IIS data to o_da tab ( 3 1 : o ) „ i_oe= 1 
4=drive IIS data to o_datac [ 31 ; 0 ], i_oe=l 



Operand Organiser Registers 

Note that there arc two identical Operand Organisers: Operand Organiser B and Operand 
Organiser C The registers for these two Operand Organisers (which are identical) arc 
described here. 



ce. oon_cfg <oob_cfg=0x70, ooc_cfg=0x80) 
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Table 1.84 oon_cfg Register Fields 









(3U9) 




Reserved 


[8] 


operate 


the coprocessor! bit for OO :(self-cleared at end of each 
instruction) 

0«ldle 

1 « operate 


[7:2) 




Reserved 


[1) 


E 


0= disabled 
1 -enabled 


IOJ 




Reserved 
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cf. oon_stal (oob_cfg=0x71, ooc_cfg=:0x81) 
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Table 1.85 oon_stat Register Fields 







[3i:?A] 




Reserved * 


[23:!6J 


suie[7:0] 


internal state condition : 
bit 0 = 00 active 

bit 1 = 00 staled (sequential mode cpjy) 
bit 2 = FIFO empty 
bit 3 « FIFO full 


[15:0] 




Reserved 



eg. oon_err_jn! {oob_err_int=0x72, ooc_errjnt=0x82) 



Table 1.85 oonerr_int Register Fields 







error condiaon : -^-> -^ -V 


[15:0] 


error 


bit 0= OBus request received by OO while disabled 

bit 1- MDP and JPEG Coder requesting data simultaneously 



ch. oon_errjnt_en (oob„err_int_en=0x73 f ooc_errjnt_en^0x83) 



Table 1.87 oon_err_int_en Register Fields 









(15:0) 


error 


error condition : 

bit 0= OBus request received by OO while disabled 

bit 1= MDP and JPEG Coder requesting data simultane- 
ously 
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231 

ci. oo/)_dmr (oob_dmr=0x74, ooc_t!rnr=0x&4) 
Table I .88 oon_dmr Register Fields 









[31:30] 


Is3 


lane swap for byte 3 : 

0 « from byte 0 

1 = from byte 1 

2 = from byte 2 

3 - no swap 


[29:28] 


is2 


lane swap for byte 2 : 

0 = from byte 0 

1 - from byte 1 

2 = no swap 

3 = from byte 3 


[27:26] 


Isl 


lane swap for byre I : 

0 - from byre 0 

1 - no swap 

2 from byte 2 

3 - from byte 3 


[25:24| 

f 


IsO 


lane swap for byte 0 : 

0 = no swap 

1 « from byte 1 

2 - from byte 2 
j = jic/iij rjytc j 


[23:20] 


subenf3-01 

J L* L/V4 VJ 


ouuoLiLuuon enaoies . 

0 « do not perform substitution operation on this byte 

1 ~ use value stored in oon_subrt for this byte 


[19:15] 


replicate 


indicates number of additional data items to generate 


[14:12) 


normalise 


Normalisation factor : 
0« 1 bit per channel 
1^2 bits per channel 

2 = 4 bits per channel 

3 » 8 bits per channel 

4 to 7 « 16 bits per channel 


[11] 




Reserved 


[10:8) 


bo[2;OJ 


Bit offset within byte of bit object 


m 


P 


Externa] Format : 

0 * input objects are unpacked bytes 

1 » input objects are a packed stream 


mi 


if 


Internal Format : 

0 = pixels 

1 = unpacked bytes 

2 - packed bjtcs 
3= other 
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oor._dfr.i- Hegister Fields 







^8^^ 


[4:3] 


cc 


Input Channel Count : 

0 = 4 active channels 

1 = ! active channr.U 

2 = 2 active channels 

3 = 3 active channel s 


(2) 


?, 


0 ~ )fT-: data in oo sej;* i, - 

1 - direct addiessing 


[1:0J 


what 


Addressing Mode : 

0 - bypass 

1 = sequential 

2 - riling 

3 *= const art data 



cj. oon_>ubst (oob_subst=0x75, ooc_subsi=:0x85) 



Table 1.B9 oonsubst Register Fields 



131:01^ ^| ^S^^ 


Immmmmummmm 




value of substitution data 



ck. oon^cdp (oob_cdp=0x76, ooc_cdp=0x86) 



Table 1.90 oonjcdp Register Fields 











I [31:0] 




cxiaem data pointer 



cl. oonjen (oobjen:=0x77 t oocjgn=0x87) 



Table 1 .91 oonjen Register Fields 





leu 






[ [31^J | 


len 


I length of Input operand stream 





cm. oortjsaid (oobjsaid-OxTS, ooc_sald=ox88) 



Table 1 .92 oon_said Register Fields 
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cn. oonjile (oob_tile=0x79, ooc_tile=0x89} 
Table 1 .93 oonjile Register Fields 









(31:16J 


offset[ 1 5:0] 


offset into operand 


(15:01 


tilejen[15:0] 


length -1 , in bytes, of operand for tiling operation 



Pixel Organiser Registers 



co. po_cfg 
Table 1 .94 po_cfg Register Fields 







ifmmmmtmmmmmKm 


[26:24] 


burst[2:0) 


log base 2 of maximum burst length for input data. Data 
bursts are aligned to addresses of this val ue. 


(22:21) 




Number of input channels for JPEG compression : 
0.1= Single channel 

3-4 channels 


(20] 


SS 


0 - no subsampling 

1 = perform subsampling 


(19J 


F 


Filter option for subsampling : 

0 = uo filtering (use every second byte) 

1 « apply filtering (average) 


[18:16] 




Reserved 


(15) 


dsr 


destination for PBus data : 

0 = MDP 

1 - JPEG Coder 


U4;123 


mode(2:0] 


Operating mode for PO : 
0«ldle 

1 « sequential mode 

2 m JPEG Compression 

3 - CSC 

4 to 7= Convoludon/lmage Transformation 


It 1:4) . 




Reserved 


[3] 


S 


0«not in self test mode 
1 - self test mode 


12] 


B 


0 - not bypassed 

1 - bypassed 


[1] 


E 


0- disabled 
\ e enabled 


[0] 




Reserved 
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SO? 

cp. po^slal 
Table 1 .95 po_stat Register Fields 













(Reserved 


126] 


rnuv_miss 


0 = no MUV miss 

1 = MUV miss 


[25] 




0= PO not stalled 

1 = current PO operation stalled 


[24] 


po_active 


0 = PO not active 

1 *= PO active 


[201 


sxn^the copro- 
cessor! 


PO the coprocessort state 


[19} 


ack_rcvd 


ibus ack received 


[18] 


ibus_req,made 


ibus rcq made 


[17] 


rifo.fuii 


FIFO full — 


[16) 


fifo_empty 


FIFO empty 


[15:0] 




Reserved 


cq. po_err_in! 




po_er_int Re< 


gister Fields 












error 


error condition : 

bit 0 - D3us i_ack asserted to PO while disabled 
bit 1 = both PO and RO using MUV RAM simultane- 
ously 

bit 2 « jpeg mode data overflow 



cr. po_errJnt_en 



Table 1.97 po_errjnt_en Register Relds 









[7:0} 


errrnask 


error mask: 

bit 0 = IBus Lack asserted to PO while disabled 

bit I = both PO and RO using MUV RAM simultane- 
ously 

bit 2 = jpeg mode data overflow 
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cs. po_dmr 
Table 1.98 po_<Jmr Register Fields 









[3 1: 30] 


Is3 


lane swap for hytc 3 : 

0 - from byte 0 

1 = from byte 1 

2 = from byte 2 

3 = no swap 


[29:28] 


is2 


lane swap for b)ie 2 : 

0 « from byte 0 

1 = from byte 1 

2 = no swap 

3 «* from byte 3 


[27:26] 


Is! 


lane swap for byte 1 : 

0 ~ from byte 0 

1 — no swap 

2 =* from byte 2 

3 = from byte 3 


(25:24] 


IsO 


lane swap for byte 0 : 

0 - no swap 

1 - from byte 1 

2 c from byte 2 

3 = from byte 3 


[23:20] 


suben[3:0] 


Byte Substitution Enables : 

0 « do not substitute data from po subst register for 
this byte *" 

1 =■ use pojsubst register value for this byte 


[1*15) 


replicate[4:0] 


Replication Count : 

indicates number of additional internal data herns to 
generate 


[14:12} 


nonn_,faaor[2 


Normalisation factor for input data : 
0« 1 bit ^component 
1 " 2 bits per component 

2 » 4 bits per component 

3 =» 8 bits per component 

4 to 7 « 16 bits per component 


[HI 




Reserved 


[10:8] 


to[2:0) 


bat offset within byte; 

Bit addressing is big endian 


17) 


P 


External format : 

0 - input data consists of unpacked bytes 

1 - input consists of a packed stream 
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Table 1 .98 po_dmr Register Fields 







[6-51 




Internal format of data passed to output : 
0- pixels 

1 - unpacked bytes 

2 = packed bytes 

3 = other 


[4:3) 


ccf 1:0] 


Channel Count for input sue am : 
0=4 active channels 

1 = 1 active channel 

2 = 2 active channels 

3 = 3 active channels 


12) 


L 


0 m immediaie operand ("short" format) 

1 *= direct addressing ("long" Format) 


[1:0] 


what[l:0] 


DMU addressing mode : 

0 = bypass 

1 - sequential 

2 = tiling 

3 « constant 



ct. po_subst 



Table 1,99 po_subst Register Fields 



asiiii 




value of su^^onefcua^ ^ ^ '" ^"l '*&'t. M 


31:0 


subst[31^)J 




cu. po__cdp 






po_cdp Register Fields 








31.-0 | 


cdp[31:0] 


address of current data item 


cv * PoJen 






pojen Register Fields 




t«ii$ill 






31:0 


»en(31:0] 


length of the operand 
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cw. po_said 
Table 1.102 po_said Register Fields 







131:0} 


ad 


the coprocessor address or operand data (or immediate " 
ctata value) 



ex. po_idr 
Table 1.103 pojdr Register Relds 









t31:0] 


width[3l:16] 
hd£tu[15:0] 


width - 1. in pixels, of cuntiu image 
height -1 , in lines, of cuncnt image 



C Y- po_muv_vaBd 
Table 1.104 po_muv_valid Register Fields 







m 








valid 


Valid bits for MUV lines 





cz, po__muv 
Table 1.105 po_muv Register Fields 







[31:11] 





Main Data Rath Registers 

da. mdp_cfg 
All bits arc reset to 0. 

Table 1.106 mdp_cfg Register Relds 




0- disabled 
1 = enabled 



0-not bypassed 
I ^bypassed 



0=not in self test mode 
l=self test mode 
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Table 1.106 mdp_cfg Register Fields 







14 


decode 


0= Instruction decoding disabled 

1 = Iisuuuiun decoding enabled: mdp_opl and mdp_op2 

S£t up from nidp_cfg (self clearing) 


15 


operate 


Lht* coprocessors . .elf clearing) 


17 


wordjnput 


0= horizontal interpolation with byte inputs 
1 = horiioraal interpolation with word input 


18 


lang__krni 


fonnai of keinal disci pi or 

0 = short forrnar 

1 = long format 


19 


blend_gen 


0 = blend generation not required 

1 = blend generation required 


31:20 


Instruo 
tion|15;OI 


Instruction major and minor opcodes 



db. mdp_sta1 

All bits are reset to zero 

Table 1.107 mdp^stat Register Fields 



o 


po_ya!id 


status of 'po^valid* signal 


1 


po_final 


status of *po_finaT signal 


2 


po_steII 


status of "po_siair signal 


3 


oob_vaiid 


status of *oob„ack* signal 


4 


oob_req 


status of *oob_jeq' signal 


5 


oob_pendin 

g 


0 = no outstanding requests in OOB Interface 

1 - outstanding lequests in OOB interface 


6 


ooc_yalid 


status of 'ooc_ack' signal 


7 


ooc_req 


status orooc.req' signal 


8 


ooc_peridifi 
9 


0 - no outstanding requests in OOC interface 

1 ■* outstanding requests i n OOC interface 


0 - 


ro^valkJ 


status of *(Q_vattd* signal 


ID 


rojinal 


status of 'lojinal * signal 


11 


ro_stall 


status of 'ro.stall' signal 


12:13 . 


matmul stat 
e[l:0] 


matrix multiplication state 
0-idie 

1 - multiplying the iefi half of matrix 

2 o multiplying the right half of matrix 


14:15 


inLstat(l:0] 


Interpol ation by fixed proportion stales: 

0 « cycle 0 

1 » cycle I 

2 = cycle 2 

3 « cycle 3 
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Table 1.107 mdp^stat Register Fields 







16 


jump 


ramp generator mode (in blend generation) 

0 - step mode 

1 = jump mode 


17:18 


addgen stat 
e( 1:0| 


state of the Ad die ss Generation state machine 

0 = idle 

1 - fetcliing operand from PO 

2 - generating address 


19:20 


ramp_state 


state of rampoeneration state machine 
O-idle 

1 = determine mode of operation 

2 = generating blend 


dc. mdp_j 

All Wire arc* ri 

■r\ll L/ILa «2fC 1 

mdp_err_ir 


erMnt 
eset to zero. 

it Register Fields 






erro^ 


7:0 


error[7:0J 


not the coprooessoned 

errortl] = receive data from OOB when disabled or MDP 
is not the coprocessoned 

error[2) - receive data from OOC when disabled or MDP 
i s not the coprocessoned 


24:8 


excep- 
tionI15:0] 


exceptionIO] - underflow oroverflow in clamping in 
channel 0 

exceptionfl ] « underflow or overflow in clamping in 
channe l i 

exceptional *= underflow oroverflow in clamping in 
channel 2 

exception^) « underflow oroverflow in damping in 
channel 3 

exception[4] « x co-ordinate underflow (image tiansfor- 
madons and convolutions) 

exceptional = y coordinate underflow (image transfor- 
mations and convolutions) 
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dd. mdp_crr_irtt_en 
All biisare itsa to zero. 
Table 1.109 mdp_crr_jnt_en Register Fields 









7:0 


err mask[7:C 
] 


mask error condition 

0 = masked 

1 - noi masked 

en_ma$k[0J = receive dam from PO when disabled or 
MDP is not the coproees sorted 

errmaskU] = receive data from OOB when disabled cr 
MDP is not the coproccssoned 

err_rnajtk[2] «= receive data from OOC wherTdisabled or 
MDP is not the coprocessoned 


24:8 


exp ma$k(1 
5:0) 

r 


mask exception 

0 = masked 

1 ~ not masked 

exp_mask[0] = underflow or overflow in clamping in 
channelO 

exp_mask(l] = underflow or overflow in clamping in 
channel 1 

«p_mask{2] « itnderflow or overflow in clamping in 
channel 2 s 

cxp_rnaskp) = underflow or overflow in clampins in 
channel 3 ~ , 

cxp_mask[4J - x co-ordinate underflow (image transfor- 
mations and convolutions) 

exp_mask[5) « y co-ordinate underflow (ima^c transfor- 
mations and convolutions) 



de. mdp^test 
AQ bits are reset toO 

Table 1.110 mripjest Register Relds 



-mm 






3.-0 


tesLdataPrOJ 


The source of data to RO un^r te$t mcKfc P ^ 

0-3 -RO daub f nan TO l,2or3 

woid have the same effect) 

4 o RO data Is from oob__dat a[3 1 ,*0] 

5 » RO data Is from oob_data[63:32J 

6 m RO data is from oob_data[95:64) 

7 = RO data Is from oob_da tn[ 1 27:96 1 

8 - RO data is from cxx^data[3 1:0} 

9 « RO data is from ooc_data{63:32J 

10 = RO data is from ooc_data[95:64J 

11 «ROdata is from ooc_data[ 127:96] 
12-15 = reserved 


25:4 


reserved 
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Table 1.1 10 mdp_test Register Fields 



wv.<v:-!-.;.y yyy^ v\v: : :*:5:.»tt 






29:26 


output_delay[ 

3:0] 


the number of clock cycles between input and output - 2 


31:30 


skeletal mod 
e[l:0] 


mode of skeletal MDP 

0 = PO to RO with some delay 

1 = PO -> OOB -> OOC -> RO with some delay 

2 = the least significant byte of the data from PO, OOB 
and OOC are combined to give the result word to RO. 

3 = reserved 



df. mdp_op1 

All bits are reset to zero 

Table 1.111 mdp^opl Register Fields 





^^^^^^^^^ 




2:0 


ppb modeA[2 
:0] 


mode of n^ti-Amcdm block A in R^nx>awsin^ Bloclf 


5:3 


ppb modeB[2 
:0] 


mode of inuln-function block B in Preprocessing Block 


8:6 


pba modeA[2 
:D] 


mode of muld-f unction block A in Stase A Processin° 
Block 


11:9 


pba_modeB[2 
:0J 


mode of multi-function block B in Stase A Processing 
Block 


14:12 


pba modeC(2~ 
:0I 


mode of multi-function block C in Stage A Processing 


17:15 


pbb_modeA[2 
:o] 


mode of multi-function block A in Stage B Processing 
Block 


20:18 


pbb modeB[2 
:0] 


mode of multi-function block B in Stage B Processing 
Block 


2351 


pbb_jnodeC[2 
:0] 


mode of multi-function block C in Stage B Processina 
Block 
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Table 1.111 mdp_op1 Register Fields 









27:24 


iri5t_type[3:0] 


Type of instruction: 

0 = no_op (which implies lhat [to Mf)P does not for that 
instruction) 

1 = GCSC 

2 = icSStveU 

3= Matrix Multiplication 

4 = Convolutions 

5 = Image Tiaiisforniaiicri 

6 = reserved 

/ = Half Toning 

8 = Residual mergin* 

9 = bypass (which implies thai MDP only needs to pass 
data from PO to RO) 

10 = Horizontal interpolation 

11 = Vertical Interpolation 
12-13 = compositing 
14-15 - reserved 


28 


(MM 


Interpolation required in Image iransf on nation 

0 - no interpolation 

1 - interpolation 


29 


it acc 


Accumulation required in linage transformation 

0 ^ no accumulation 

1 = accumulation 


30 


camp__unpre 


Unrprt -multiplication required in Compositing 

0 - no un-pre-mulUplication 

1 -un-prc-multiplieation 


31 


comp_Diend 


Blend generation required in compositing 

0 6= no Wend genera ti on 

1 «= blend generation 



dg. mdp_op2 

All bits aie reset to zero. 



Table 1.112 imfpj>p2 Register Fields 









1:0 




multiplicand of Operand A pixel 

0- 0 

1- 1 

2=opaciiyB 
3 -HjpadtyB . 




3:2 


mul_B[iuj 


multiplicand of Operand B pixcJ 
0=0 

1 =1 

2 « opacityA 








3 = -opacityA 





( 557 ) 



&m¥l 1-85969 



Table 1.112 mdp_op2 Register Fields 









4 


reverse 


0 - do not reverse operand in compositing 
"i - reverse operand in compositing 


C 

O 


addgen_mod 
e 


Address generation mode 

0 = image transformation mode 

1 - convolution mode 


6 


self_cfg 


0 ^ lon£ kernel descriptor 

1 = short kernel descriptor, other parameters are self configured 


7 


reserved 




8 


ag_the 
coprocessort 


address generation the coprocessor! bit. It is cleared when the 
address generation is finished 


9 


bg_the 
coprocessort 


Mend generation the coprocessort bit. It is cleared when the Wend 
generation is finished. 


10 


maMhe 
coprocessort 


matrix multiplication the coprocessoa tti It the coprocessors the 
matrix multiplication state machine inside the MDPII. It is cleared 
by thai state machine whenit sees the po_f inal signal asserted. 
(Only valid for Matrix Multiplication instruction.) 


11 


inMhe 
coprocessort 


interpolation the coprocessort bit. It the coprocessors the interpo- 
lation state machine inside MDPII. Once the coprccessorted, this 
bit is set until the state machine sees po_f inal is asserted, (only 
valid for horizontal and interpolation instruction) 


12 


inl_size 


0 « interpolate with packed bytes 

1 = interpolate with unpacked bytes or pixels 
(only valid for horizontal and vertical interpolation) 


13 


int_4 


0 = interpolate by a factor of 2 

1 - interpolate by a factor of 4 

(only valid for horizontal and vertical interpolation) 


14 


int_verticai 


0 - horizontal interpolation 

1 - vertical interpolation 


15 


locK_step 


0 = the PO, OOB and OOC interfaces are orvrarino irvVrvnrfi-nr n p 
each other 

1 « the PO, OOB and OOC interfaces are locked together, so they 
will only accept data together. 


19:16 


* COCI W*J 




2150 


cw configfl: 
OJ 


Clamp-or- wrapper configuration 

0 « wrapped and no absolute value 

1 = wrapped and absolute value 

2 = clamp (overflow toOxFF, underflow to 0x00), but no absolute 
value 

3 = absolute value and clamp 


22 


* f _en 


Fraction Rounder corifiguration 

0 = disabled (returns 0) 

1 ^enabled 
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Table 1.112 mdp_op2 Register Fields 









24:23 


oob mode[t: 
0) 


Mode of OOB interface operation: 

0 = disabled 

1 = sequential 

2 = random 

3 = blend_generaiion 


26:25 


ooc mode[ 1: 
0] 


Mode of OOC interface operation: 

0 « disabled 

1 = sequential 

2 = GCSC 

3 = pixel 


3027 


trans[3:0) 


In Cora positing operation 

0 = do not subtract offset for this channel 

1 = subtract offset for this channel 

In Colour Space Conversion operation 

0 ^ do not apply translation and clamping to output value on this 
channel 

1 = use translation and clamping on thi s output channel 
In Image Transformation or convolution operations 

0 initialise accumulator to 0 for this channel 

1 - initialise accumulator to nxlp_|>oKOOOO fortius cliannd 


31 




reserved 



dh. mdp_por 

A.H bits are reset to zero. 

Table 1.113 mdp_por Register Fields 









[7:0] 


oflsetO[7:0] 


offset for plus operator on channel 0 


[15:8] 


offset I [7:0] 


offset for plus operator on channel 1 


P3:i6) 


Gffset2[7:0] 


offset for plus operator on channel 2 


pi:24] 


ofTsetSp.-O) v 


offset for plus operator on channel 3 


{31:0} 


o/TsfctpirO) 


effcel for convolutions ind transformations 



di. mdp_bi 

AH bits are reset to zero. The mdp_W register is used for different things in different modes: 



Table 1.114 mdp_bi Register Ftelds (compositing mod a) 



m 






[7:0) 


WenderuJO . 


the coprocessort value of blend on channel 0 


115:81 


blendendl 


the coprocessort value of blend on channel 1 
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Table 1.114 mdp_bi Register Fields (compositing mode) 
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[23:16] 


blcndend2 


the coproccssorx value of blend on channel 2 


[31:24] 


blendend3 


the coprocessor value of blend on channel 3 


> mdp__bi Register Fields (non-compositing mode) 








[31:2] 


ioffsci 


offset into the index table 



dj. mdp_bm 

All bits arc reset to zero. The mdp_>m register is used for different things in different 
modes: 



Table 1.116 mdp_bm Register Fields (compositing mode) 





^^|^t eld^ligg 




(7:0] 


WendendO 


end value of blend on channel 0 


[15:8} 


blendendl 


end value of blend on channel 1 


[23:16] 


blende nd2 


end value of blend on channel 2 


[31:24] 


WerdenrB 


end value of blend on channel 3 



Table 1 .1 1 7 mdp_bm Register Fields (non-compositing mode) 









[3:0] 


rows[3:0] 


number of tows in the matrix. 


rr:4) 


COi5l3:0] 


number of columns in the matrix 


[15:8] 


lcvd[7:0] 


number of levels in halftoning 


[20:16] 


bp[4:0] 


location of binary point 



dk. mdpjen 

All bits are reset to zero. 

Table 1.118 mdpjen Register Fields 









3l:0 


length 


length of blend to be produced 
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JPEG Coder Registers 
dt. jc_cfg 



jc_cfg Register Fields for JPEG instructions 



; : : -p^fafBiis)i. 






[31:28] 


mop 


Major opcode from instruction = 0010 


W) 


D 


0 *=> JPEG compress 

1 = JPEG decompress 


[26] 


M 


0 = single colour channel 
1 » multiple colour channels 


[25) 


A 


0= three channel 
1 - four channel 


[24] 


S 


0 = do not use subsampling regime 

1 - use subsampling regime 


[23] 




reserved 


[22] 


H 


0 « use fast hufrrnan algorithm 

1 = use slow huffman algorithm 


121:16] | 


reserved 


(15} 


0 


0 = JC is not operational 

1 - JC is operational 


[14] 


dec 


0 » disable decoding of instruction 

1 = enable decoding of instruction 


[13:8] | 


reserved 


[7] 


A 


0 - align rethe coprocessor! markers to byte bounda- 
ries 

1 » align rcthc coprocessort markers to word bounda- 
ries 


(6J 


Z 


0 » pad with Is 

1 »pad with Gs 


[5:4] 




reserved 


13] 


T ' \ 


0 « not inselft est mode 
1- In self lest mode 


[2] 


B 


0 m not bypassed 

1 - bypassed 


m 


E 


0= disabled 
1 - enabled 


(0) | 


reserved 
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Table 1.120 jc_clg Register Fields for data coding instructions 









[3l:2B] 


mop 


Major opcode from instruction = 0010 


[27] 


D 


0 = compress 

1 -decompress 


[26] 




reserved 


[25:24] 


difr 


Difference between the number of input bytes and 
the number.of output bytes: 

00=* no difference 

0 1 = one extra output byte than input byte 
10 — 11 « one less input byte than output byte 


[23] 


op 


operation: 
0 = huff man 


[22:19] 




reserved 


f 1 S-l/Cl 
[ 10-lOj 


IDO 


input bit offset 


[151 


O 


0 = JC is not operational 

1 = JC is operational 


[14] 


dec 


0 = disable decoding of instruction 

1 = enable decoding of instruction 


[13:71 






[6J 


2 


0" pad with Is 
1 - pad with Ds 


[5:4] 




reserved 


P] 


T 


0 - not inself test mode 

1 =: in self test mode 


12) 


B 


O =. not bypassed 
1 « bypassed 


m 


E 


0 = disabled 

1 - enabled 


[OJ 




reserved 



Tabie 1.121 Jcjtfg Register Fields for memory copy instructions 



ram 






[31:28] 


mop 


Major opcode from instruction « 1001 


[27] 


D 


0 - general purpose data movement 

1 « local DMA 

This bit should always be set to *0* 


[26) 


. B 


0 « not a bit copy operation 

1 b bit copy operation 
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Tahlr> 1.121 jc_cfg Register Fields (or memory copy instfuctians 









125:24] 


diff 


Difference between Lbe number uf input bytes and 
the number of output bytes: 

00 = no difference 

01 = one extra output by it than input byte 

10 - 1 1 = one fess input byte than output byte 


(231 | 


reserved 


[22:20] ! obo 


oi u pur hii offset 


[19] 




reserved 


[18:16] 


ibo 


input bit offset 


[15) 


0 


0 =JC is not operational 

1 -SC is operational 


[K] 


dec 


0 = disable decoding of instruction 

1 = enable decoding of instruai on 


[13:4] 




reserved 


[31 


T 


0 « not inself test mode 

1 « in self test mode 


[2J 


B 


0» not bypassed 
1 = bypassed 


rn j e 


0- disabled 
1 = enabled 


[0] | 


reserved 



dm. Jcjatat 



Table 1 .1 22 jejetat Register Fields 







[31:8] 


reserved 




£7:0] 


" state 


to be finalised 


dn. Jcjerrjnt 
|c_errjnt Register Fields 




EMU 








Teserved 


[l»l 


huffJIUabte 
s 


illegal huffraan table. More than rtihe hufrman table 
heap misses occured. 


[17) 


huffJlLmpo 

■ s 


illegal marker position 


[16] 


huffJILmar 
ker 


illegal market 



( 563 ) 



4*B8¥l 1 - 8 5 9 6 



^50 

Table 1.123 }c_errjnt Register Fields 







c ^Description^ ; # ^■■t^ t -^ i r- 




coeff ill. AC 


illegal AC coefficient value (-1024) during 


r t ai 


COCil_lll_LJL 


ill— — 1 r\.s"> ■ 

illegal DC value 


fl3] 


cocff_iIl_AC 


illegal AC magnitude category 


[12] 


coeff_i)l_DC 
„ma° 


illegal DC magnitude category 


UU 


cocffJILRS 
■j- 


illegal RST m .count value 


UO] 


coeff_overflo 
w 


data overflow deieaed by coeff coder * 


[9) 


coefTJlljrnp 

OS 


illegal marker position 


(SI 


jpeg^underfl 
ow 


underflow 


[I) 


jpeg^disaUe 
d 


received data while disabled error 


10] 


huff_JD_syn] 

boi 


illegal huffman symbol error 



do. Jc_errjni_en 



Table 1.124 jc_err_int_en Register Fields 









[31:19] 




reserved 


H8J 


s 


illegal huffman table. More than nine huffman table 
heap misses occuied. 


[17] 


hulTJlLmpo 

3 


illegal marker position 


116] 


huffjn_rnar 
ker 


illegal marker 


US] 


cocffJILAC 


illegal AC coefficient value (-1024) during 


[MJ 


coeffJU _PC 


illegal DC value 


[13] 


CoefTJlLAC 


illegal AC magnitude category 


[12] 


coefTJtlLDC 
_mag 


Illegal DC magnitude category 


Ulj 


coeff_ill_RS 
T 


illegal RST m count value 


HO] 


coeff.overflo 

w 


data overflow detected by coeff coder 


[9] 


coefT_ill_mp 

OS 


illegal marker position 


(8) 


jpcg_underfl 
ow 


underflow 
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2>V 

Table 1.124 jc_err_inl_cn Register Fields 





7: Mameii^ 






jpe£_disable 
d 


received data while disabled error 


[0] 


huIF_UI sym 
bo) 


ills gal hull man symbol error 


dp. jc_rsi 
jc_rsi Register F 


iotds 








(15:0] 


rsi 


number of MCU blocks between rethe coprocessoa 
marVers 



d q- jc_decode 



Table 1 .1 26 Jcjtecotie Register Fields 





MM 




[0] 


dct_enable 


enable dct submodule 


[1] 


dct_bypass 


put dct subrriodulc into bypass i nude 


p] 


(Iccforward 


put da inio forward mode 


p) 


qdq__enablc 


cnabic quantizer subniodule 


M 


qd^bypass 


bypass quantizer submodulc 


[5] 


qdq_forward 


put quantiwr into forward mode 


[6) 


qrjqjour 


four channel image 


PJ 


qdq_£ubsmpl 


subsarapled image 


C83 


ccjenable 


enable the coeff coder submodule 


[91 


ccJ?ypass 


bypass coeff coder submodule 


[10] 


ce^forward 


put coeff coder Jmp/orwajd mode . 


111) 




code jpeg compliant stream 


[121 


CC__SUtBlDpl 




[13J 


cc Jburchartnel 




[1«] 


ccjnultichannel 




[15] 




enable huflhan coder submodule. 


n«i 




bypass taiflraancoder 


[17] 


hc_forward 


put huff man codex into forward mode 


[18] 


hc_subsmpl 


subsampled Image 


[19] 


hc^fast 


perform fast liuffman coding 


[20] 


hcjpcg 


perform jpeg compliant hutTman coding 


[21) 


he Jour 


four channel image 
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Table 1.1 2G jc_decode Register Fields 







; .<« * ■ . . ""vV. ■ •.•:■>** •.i^%?$®&)&ii8e^t*2& sea-? 


[22] 


hc_alisn 


align RST m markers on word bound aries 


[231 


hc„zerop£d 


0 = pad using * 1 *s 

1 - pad using 'O's 


[■24] 


hc_merncopy 


perform memory copy operation 








(26] 


qdq_rnulo 




[21] 


hc.muki 




[3 1:22 J 




reserved 



dr. jc_res 



Table 1.127 jc_res Register Folds 







R:0] | res 


Residual value 



ds. jc_table_sel 



Table 1.128 jc_lable_sel Register Fields 









[13:8] 


jc„cabte_seL 
quant 


decoded table selection for quantisation 


15:0J 


jc_tab!e_seL 
huff 


decoded table selection for huffman decoding 



Result Organiser Registers 



dt rojcfg 
Table 1.129 ro_cfg Register Fields 







[31:23] 


reserved 




[22:21] 


chan 


JPEG decompressed output data format : 
0,1** Single-channel 

2 *• 3-channel 

3 = 4-cliannel 


[20] 


upssrnple 


0 = noupsaropling 

1 = upsarnple data from MUV RAM 
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Table 1.129 ro_clg Register Fields 









(19:18] 


reserved 




117] 


use_cut 


O=do not use value in cut register to cut output data, 

l=use value in cut register to cut output data_ 

This bit is reset back toO on completion of an instruction. 


U6] . 


use Jim it 


0=do not use value in limit register to limit output data. 

I=use value in limit register to limit output data. 

Tlus Ml is reset back to 0 on completion of an in^! ruction. 


[15:14] 




Reserved 


[13:12] 


mode( 1:0) 


Current mode of o peraiion: 

0 ~ idle 

1 - sequential 

2 - JPEG decompression 

3 ~ CBus data 


Ul:4) 




Reserved 


[3) 


S 


0 « not in self lest mode 

1 » self test mode 


[2] 


B 


0 = not bypassed 
1 « bypassed 


ID 


E 


0 = disabled 

1 = enabled 


[0] 




Reserved 



du. ro_stat 



Table 1.130 ro_stat Register Fields 









[31:24] 




Reserved 


[23:16] 


state[7:0] 


internal state condition ; 
tatO-RO stalled 
bit 1 «RO active 
bit 2 =* FIFO full 
bit 3 = FIFO empty 


[15:0] 




Reserved 
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dv. ro_err_jnt 
Table 1.131 ro_errjn1 Register Fields 









[7:0] 


error 


error condition : 

bit 0 = request when disabled 

bit 1- both MDP and JPEG Coder aciivc ai same time 

bit 2= illegal address error 

bit 3 = jpeg mode data overflow 


dw. ro_err_int_en 




ro_err_int_en Register Fields 








17:01 


err_mask 


error condition mask - enable corresponding error 



dx. rojdmr 



Table 1.133 ro_drnr Register Fields 









[31:30] 


Is3 


lane swap for byte 3 : 

0 e from byte 0 

1 = from byte I 

2 =• from byte 2 

3 = no swap 


[29:28) 


ls2 


lam swap for byte 2: 
0 = from byte 0 
t = from byte 1 

2 = no swap 

3 - from byte 3 


[27:26] 


lsl 


tare swap for byte 1 : 
OofrombyteO 

1 - no swap 

2 • from byte 2 

3 m rram byte 3 


[25:24] 


IsO 


lane swap for byte 0 : 

0 = no swap 

1 -from byte 1 

2 - from byit 2 

3 = from byte 3 
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Table 1.133 ro_dmr Register Fields 









[23:20] 


suben[3:0] 


Substitution Enables : 

0 = do perform substitution for tiiis byie 

1 - use value stored In ro_subst for this byte 


[i£l6] 


wrmask 


Write Masks : 

0 - write out corresponding byte channel 

1 = do n<y w ri te our c orr>: spon d i n ° bvt - c ! - ;j u id 


[I5| 


cmbs 


Choose most significant bits 
0=choose least significant bits of a byte when per- 
forming denormalisation 

1== choose most significant bits of a byte when per- 
forming denorrnalisation 


[H:12] 


normalise 


DenormaJisation factor : 
0 = J bit data objects 
1=2 bit data objects 

2 = 4 bit data objects 

3 - $ bit data objects 

4 to 7 ** 1 6 bit data objects 


111:8] 




Reserved 


(7) 


P 


External Format : 

0 *» unpacked bytes 

1 = packed stream 


(6:5] 


if 


Internal Format : 

1 « unpacked bytes 

2 «= packed bytes 

3 o other 


[4:3] 


cc 


Channel Count : 

0 <= 4 active channels 

1 = 1 active channels 

2 « 2 active channels 

3 a 3 active channels 






Reserved 



dy. ro_subst 



Table 1.154 ro.subst Register Fields 
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dz. ro_cdp 
Table 1.135 ro_cdp Register Fields 









{31:0] | cdp(3l:Uj 


address of current data item 



ea. ro len 



Table 1.136 rojen Register Fields 











lH 


(31:0J 


| len[31:0) 


Output Byte count 



eb. ro sa 



Table 1.137 ro_so Register Fields 



t* 1:Q ] | sa[3l:0] | the coprocessort address 



ec. ro idr 



Table 1.13$ rojdr Register Fields 







B8HI 






(31:0] 


height [1 5:0) 
width[3l:l6] 


height - l,in lines, of current image 
width - I , in pixels, of current image 



ed. ro vbase 



Table 1 .139 ro_vbase Register Fields 



B^iij " | vbasel3l:12| 






the coprocessor Virmal Address Base 



ee. ro cut 



Table 1 .140 ro_cut Register Fields 







31:0 eui[3J:0] 


output cut offset: throw this many bytes away 



Kims 

k 



ft? 



1 *3* 



:iiij-j>f«'.iMK 
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ef. rojmt 
Table 1.141 rojmt Register Fields 











Imil31:0] 


lmt[3l:Oj 


limit to the number of output bytes j 



PCI Configuration Space Alias 

PCI cenfigu ratio space Is a 25G-by-c block p." v^.^crs ctci.r.cd ia 1X11 spec, which 
allows the host to configure the PCI (ieviee, and to read its status. It is accessed using PCI 
configuration cycles. The register contents are also mirrored into a read-only area of the 
coprocessor's internal memory space, so that they can be read via normal Fa bus memory 
cycles- 

The formal of the configuration space implemented in the EIC is shown in Figure 1. 1 
Figure 1.1 the coprocessor PCI Configuration Space Layout 



31 


16 15 


0 




Device ID 


Vendor ID 


DxOO 


Status 


command 


0X04 


Class Code 


Revision ID 


0*08 


Reserved 


Header Type 


Latency 
Timer 


Cache Line 
Size 


OxOC 


Base Address 


0X10 


Reserved 


0x14-0x28 


Subsystem ID 


Subsystem Vendor ID 


0x2C 


Reserved 


0x30-0x38 


MaxLat 


MinJSnt 


Interrupt Pin 


Interrupt Line 


0x3C 



Reserved registers, and reserved bits of Implemented registers, will return 0 on reads and 
will not be affected by writes. Configuration space addresses in the range 0x40-0xFF are 
also reserved - no vendor specific configuration registers are denned. 

eg. Vendor ID 

This register fc read only. ClSRATs Vendor ID is oxl I AC. 

eh. Device ID 

This register is read only, the coprocessor's Device ID is 0x0001. The Device ID field is split 
up into two 8 bit fields: the most significant 8 bits is a number characteristic of the device 
(OxO for the coprocessor) and the least significant 8 bits represents a version number of [hut 
device (Ox 1 for this version of the coprocessor)* 
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ei. Command Register 

The defini uons of the fields in the Command register arc shown in Tabic 1 .142. ^11 non- 
reserved bits in Lhis register arc readAvritc. After reset, this register is set to OxOOOO. 

Table 1.142 the coprocessor PCI Command Register Fields 









0 


IO Space 


Reserved. 


1 


Memory Space 


Enables the coprocessor to respond to memory 
cycles. 


1 


Bus Master 


Enables the coprocessor to generate cycles on the 
PCI bus. 


3 


Special Cycles 


Reserved. 


4 


Memory Write & 
Invalidate Enable 


Reserved. 


5 


VGA Palette 
Snoop 


Reserved. 


6 


Parity Error Re- 
sponse 


Enables the assertion of perr_l on data parity 
errors, and serr_l on address parity errors (if 
bit 8 is also set). Also enables the setting of the 
Parity Error Detected bit in the Status register. 


7 


Wait cycle control 


Rcscrvcd. 


8 


serr_I* Enable 


Enables the assertion of serr_l. the coprocessor 
will only assert serr_l on address parity errors. 


9 


Fast back- to-back 
Enable 


Reserved.' 


10-15 


Reserved 


Reserved. 



ej. Status Register 

The definitions or the fields in the Status register are shown in Table 1 . 1 43. Reads to this reg- 
ister behave normally. Some bits of this register are read-only. The other bits can be set to 1 
only by the coprocessor, and can be reset to 0 only by the host (except in test modes)- The 
host resets a status bit by writing a 1 to that bit; wriung a 0 to any bit has no effect. After 
reset, this register is set to 0x0280. 

Table 1.143 the coprocessor PCI Status Register Fields 









0-4 


Reserved 


Reserved 


5 


66MHz capable 


Reserved, 


' 6 


User Definable 
Features 


Reserved. 


7 


Fast Back-to- 
Back Capable 


This bit is read-only, and set to L It indicates that 
the coprocessor behaves correctly in a system 
where fast back-to-hack cycles are addressed to 
' different target devices. 
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H 


Data Parity Error 
Delected 


"This bit is set whenever the coprocessor, as a mas- 
ter, detects a data parity error in read data, or 
detects perr_l asserted on a write. It is only se; 
if bit 6 of ihc command ie°isicr is set. When this 
bit is sc( t an interrupt is signalled. 


10-9 


devsel_l tim- 
ing 


These read only bits are set to 01, to indicate that 
ihc coprocessor responds assens its devsel_L 
with rr.edium speed when it is a target. 


11 


Signalled Target 
Abort 


Reserved. 


12 


Received Target 
Abort 


This bit is set whenever the coprocessor, as a mas- 
ter, receives a target abort. It causes the ELC to 
generate an interrupt, and to stop its operation 
until the bit is cleared by software. 


13 


Received Master 
Abort 


This bit is set whenever the coprocessor, as a mas- 
ter, executes a master abort. It causes the EIC to 
generate an interrupt, and to stop its operation 
until the bit is cleared by software. 


14 


Signalled System 
Error 


This bit is set whenever the coprocessor asserts 
serr_l. This will only occur on an address par- 
iry error. 


15 


Detecred Parity 
Error 


This bit is set whenever the coprocessor detects an 
address or data parity error. 



ek. Revision ID 

This is a read-only register, the coprocessor's initial revision ID is 0x01 . 
el. Class Code 

This Is a read-only register, the coprocessor does not fit into any of the defined class codes of 
the PCI SIG, so this register is set to OxFPOOOO. 

em. Cache Une Size 

This Ss a reaoVwrite register that specifies the cache fine size of the system in 32 bit words. It 
determines when the coprocessor will use the Memory Read Line and Memory Read Multi- 
ple commands, the coprocessor supports any value from 0 to 255 in this register. A value of 
zero in this register disables the Memory Read Line and Memory Read Multiple command 
types. It is set to 0x00 on reset 

en. Latency Timer 

This is a read/write register that specifies the largest number of docks the coprocessor can 
use for any PCI transaction, the coprocessor suppons any value in this register front 0 to 
255. It is set to 0x00 on reset. 

eo. Header Type 

This read-only register is set to 0x00. indicating that the coprocessor uses a type 0 layout for 
the configuration space. 

ep. Base Address 

This reaoVwrite register fc used iu locale the coprocessor's internal registers and memories, 
the local memory and the Generic interface in the host's memory map. the coprocessor's 
various resources occupy 64 MBytes (not all locations arc uscdX so only the top 6 bits of this 

. ■ i . , ft - ,. 
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register ax writable. The remaining address bits are all hardwired to zero. The lower four 
bits of this register arc read-only control bits, which arc also hardwired to 0. This indicates 
that die register refers to memory space, that the coprocessor can txr mapped anywhere in the 
32-bii address space of the host, and that the coprocessor resources are not prcfetchable 
whenii is a target. 

eq. Subsystem Vendor ID 

This read-only register allows the host to identify the vendor or a PCI board plugged into the 
system (as opposed to the vendor of the componem implementing the PCI interface on the 
board). The contents of this register are loaded on reset via the serial configuration port on 
the EIC. 

er. Subsystem ID 

The read-only register allows the host to identify a PCI board plug °ed into the system. The 
contents of this reeisterare loaded on reset via the serial conhVu ration port on the EIC. This 
mechanism allows any required information about the board functionality or configuration 
to be encoded external ly and read by the host 

es. Interrupt Line 

This read/write register is used by the system software to record interrupt line routing infor- 
mation so that it is accessible to interrupt service software. It has no effect on any operations 
in the coprocessor. It is sex to 0*00 on reset. 

et Interrupt Pin 

This read only register is hard wired to OxOl , indicating that the coprocessor drives PCI 
interrupt pin inta_l. 

eu. Min_Gnt 

This read only register indicates to the system the coprocessor's desired burst period length, 
in units of 1/4 mictoseconds. The optimum value for this register has not yet been deter- 
mined. 

ev. Max_Lat 

This read only register indicates to the system the coprocessor's desired maximum latency in 
gaining control of the PCI bus after a request, in units of J/4 microseconds. The optimum 
value for this register has not yet been determined. 

1.1.4 Internal Memory Map 

This section details the objects that appear in the per-module data areas within the the 
coprocessor internal rncrnory map. 



Table 1.144 the coprocessor Internal Memory Map 




0x0000- 
OfclFFF 



0*8000- 
Ox803F 



0x8040- 
0x807F 



reserved 



EIC 



reserved 



reserved 



eic_ptp 



reserved 



R/W 



MMU page table pointers 
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Table 1.144 the coprocessor Internal Memory Map 





S^fsSbciaied'S 
^Module 




:> j k/.tiii r: ±: ~ : •: 


<' S<V: *it.-V V: '-. ' .'. * ( > : W : -:V ' " , 


0x8080- 

OxSOFF 


EIC 


eic^actdr 


Read 
Only 


CAM dau wirh vinuaJ and physi- 
cal page numbers interleaved 


0x8100- 
Oxhhhh' 


reserved 


reserved 






0x10000- 
OxlFFFF 


LMC 


reserved 






0x20000- 
Ox2007F 


PIC 


input RAM 


Ryw 


32 word x32 bit RAM in PIC input 
datapath 


0x20080- 
Ox200FF 


PIC 


output RAM 


R/W 


32 word x32 bit RAM in PIC out- 
put datapath 


0X20100- 
0x2FFFF 


PIC 


reserved 






0x30000- 
0x3FFFF 


MISC 


reserved 






0x40000- 
40017 


IC 


prefetch_buf 


read 
only 




0x40018- 
0x4*-H-K 


IC 


reserved 






0x50000 


DCC 


cache RAM 


R/W 


4k byte of 32 bil wide memory 


0x51000 


DCC 


tag RAM 


RAV 


128 x 20 bit tag memory - the Most 
significant 20 bits of the 32 bit 
word are used 


0x52000 


DCC 


dec. odatabO 


R/W 


bits 3 1 :0 of o_datab bus. This 
value is driven on to the bus in the 
test mode. 


0x52004 


DCC 


dcc_odatabl 


RAV 


bits 6332 of o_datab bus. This 
value is driven on to the bus in the 
test mode. 


0x52008 


DCC 


dcc__odatab2 


RAV 


bits 95:64 of o__da tab bus. This 
value is driven on to the bus in the 
test mode. 


OxS200c 


DCC 


dcc„odatab3 


RAV 


bits 127:96 of o_datab bus. This 
value is driven on to the bus in the 
test mode. 


0x52010 


DCC 


dcc.odacacO 


RAV 


bits 31:0 of o_datac bus. This 
value is driven on to the bus in the 
test mode. 


0x52014 


DCC 


dcc_odatacl 


RAV 


bits 6332 of o_datac bus. This 
value is driven on to the bus in the 
test mode. 


0x52018 


DCC 


dcc_odatac2 


R/W 


bits 95:64 of o_datac bus. This 
value is driven on to the bus in the 
test mode. 
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2^ 

the coprocessor Internal Memory Map 









ri 1 * '>»:i> * *" * : . 






0x5201c D qq 


dcc_odatac3 


RAV 


wis 127:96 of o^datac bus. 7*7 
valueisdnvcnon to Hie bus in the 
test mode. 


0x060000- PO 
0x0607 FF 

0x060800 - J pQ 


mu v_ ram 




^UV RAM. The MUV ra^ 

appears in the same format as the 
Interval and Fraction RAM 


Ox06FFFF 


reserved 







0x70000- O0B 
OxlFFFF 

0x80000- OOC 
0x8FFFF | 


reserved 
reserved 




■ — _ 




0x90000- Jc 
Ox900FC 

0x90100- 1 JC 


Quantizer 
Buffer 1 




Uses only ire least significant 15 
bits. The most significant 17 bits 
are reserved. 


0x901 FC J 
0x90200- J JC 


Quantizer 
Buffer 2 




uses only the least significant 15 
bits. The most significant i 7 bits 
are reserved. 


Ox902PC 

OyOfnAn r~- ■■ 


DCTBuifler 




Uses only the least si gnifjcaiu 1 2 
bits. The most significant 20 bits 
arc rpc#*rvArt 

***** !vr3f^l VCLL 


0x9FFFF 
OxOAOOOO- 


JL 


reserved 






OxOAOlFF 
OX0A02O0- 


* MDP 






Lois or internal structures. 


OxQAFFFF 
OxOBOOOO- 


MDP 


reserved 






QxOBFFFF 

OxOOOCXV 
OxCFFFF 

OxDOOOO- 


RO 


reserved 
reserved ~ 






OxDFFFF 
OxBDOOO- 


US 


reserved 






FFFFF 




reserved 
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1-1.5 Memory Word Fields 
a. eic_ptp 



Table 1.145 etc_ptp Memory Word Fields 



-' ;■ ; Fi eld 1 1 ^ : .;. 




a;. ^ ^ ^V: V-^^ J^iP^T { n,i t i o h i- ^; '^/ii^^^r^V. 


J 1:0 


Reserved 




31:12 


pep 


Top 20 bits of ph y si zal : - v lc ad d rt >r c, f t h 1 o= 1 
of a 4kB segment of the page table. 
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4. BRIEF DESCRIPTION OF THE DRAWINGS 

Notwithstanding any other forms which may fall within the scope of the present 
invention, preferred forms of the invention will now be described, by way of example 
only, with reference to the accompanying drawings: 

Fig. 1 illustrates the operation of a raster image co-processor within a host 
computer environment; 

Fig, 2 illustrates the raster image co-processor of Fig. 1 in further detail; 

Fig. 3 illustrates the memory map of the raster image co-processor; 

Fig, 4 shows the relationship between a CPU, instruction queue, instruction 
operands and results in shared memory, and a co-processor; 

Fig. 5 shows the relationship between an instruction generator, memory manager, 
queue manager and co-processor; 

Fig. 6 shows the operation of the graphics co-processor reading instructions for 
execution from the pending instruction queue and placing them on the completed 
instruction queue; 

Fig. 7 shows a fixed length circular buffer implementation of the instruction 
queue, indicating the need to wait when the buffer fills; 

Fig. 8 illustrates to instruction execution streams as utilized by the co-piocessor, 
Fig. 9 illustrates an instruction execution flow chart; 

Fig. 10 illustrates the standard instruction word format utilized by the co- 
processor; 

Fig. 11 illustrates the instruction word fields of a standard instruction; 
Fig. 12 illustrates the data word fields of a standard instruction; 
Fig. 13 illustrates schematically the instruction controller of Fig. 2; 
Fig. 14 illustrates the execution controller of Fig. 13 in more detail; 
Fig. IS illustrates a state transition diagram of the instruction controller; 
Fig. 16 illustrates the instruction decoder of Fig. 13; 
Fig. 17 illustrates the instruction sequencer of Fig. 16 in more detail; 
Fig. 18 illustrates a transition diagram for the ID sequencer of Fig. 16; 
Fig. 19 illustrates schematically the prefetch buffer controller of Fig. 13 in more 
detail; , 
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Fig. 20 illustrates the standard form of register storage and module interaction as 
utilized in the co-processor; 

Fig. 21 illustrates the format of control bus transactions as utilized in the co- 
processor; 

5 Fig. 22 \: ? >e data flow through a porticr = f the co-processor: 

Figs. 23-29 illustrate various examples of data reformatting as utilized m the co- 
processor; 

Figs. 30 and 31 illustrate the format conversions carried out by the co-prooessor; 
Fig. 32 illustrates the process of input data transformation as carried out in the 
10 co-processor; 

Figs. 33-41 illustrate various further data transformations as carried out by the 
co-processor; 

Fig. 42 illustrates various internal to output data transformations carried out by 
the co-processor; 

15 Figs. 43-47 illustrate various further example data transformations carried out by 

the co-processor; 

Fig. 48 illustrates various fields utilized by internal registers to determine what 
data transformations should be carried out; 

Fig. 49 depicts a block diagram of a graphics subsystem that uses data 
20 normalization*; 

Fig. 50 illustrates a circuit diagram of a data normalization apparatus; 

Fig. 51 illustrates the pixel processing carried out for compositing operations; 

Fig. 52 illustrates the instruction word format for compositing operations; 

Fig. 53 illustrates the data word format for wmpc^itirjg operations; 
25 Fig. 54 illustrates the instruction word format for tiling operations; 

Fig. 55 illustrates the operation of a tiling instruction on an image; 

Fig. 56 illustrates the process of utilization of interval and fractional tables to re- 
map color gamuts; 

Fig. 57 illustrates the form of storage of interval and fractional tables within the 
30 MUV buffer of the co-processor; 

Fig. 58 illustrates the process of color conversion utilising interpolation as 
carried out in the co-processor; 
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Fig. 59 illustrates the refinements to the rest of the color conversion process at 
gamut edges as carried out by the co-processor; 

Fig. 60 illustrates the process of color space conversion for one output color as 
implemented in the co-pTOcessor; 

Fig. 61 illustrates the memory storage within a cache of the co-prooessor when 
utilising single color output color space conversion; 

Fig. 62 illustrates the methodology utilized for multiple color space conversion; 

Fig. 63 illustrates the process of address re-mapping for the cache when utilized 
during the process of multiple color space conversion; 

Fig. 64 illustrates the instruction word format for color space conversion 
instructions; 

Fig. 65 illustrates a method of multiple color conversion; 

Fig. 66 and 67 illustrate the formation of MCLTs during the process of JPEG 
conversion as carried out in the co-processor, 

Fig. 68 illustrates the structure of the JPEG coder of the co-processor; 

Fig. 69 illustrates the quantizer portion of Fig. 68 in more detail; 

Fig. 70 illustrates the Huffman coder of Fig. 68 In more detail; 

Figs. 71 and 72 illustrate the Huffman coder and decoder in more detail; 

Figs. 73-75 illustrate the process of cutting and limiting of JPEG data as utilized 
in the co-processor; 

Fig. 76 illustrates the instruction word format for JPEG instructions; 

Fig. 77 shows a block diagram of a typical discrete cosine transform apparatus 
(prior art); 

Fig. 78 illustrates an arithmetic data path of a prior art OCT apparatus; 

Fig. 79 shows a block diagram of a DCT apparatus utilized in the co-processor, 

Fig. 80 depicts a block diagram of the arithmetic circuit of Fig. 79 in more detail; 

Fig. 81 illustrates an arithmetic data path of the DCT apparatus of Fig. 79; 

Fig. 82 presents a representational stream of Huffman-encoded data units 
interleaved with not encoded bit fields, both byte aligned and not, as in JPEG format; 

Fig. 83 illustrates the overall architecture of a Huffman decoder of JPEG data of 
Fig. 84 in more detail; 

Fig. 84 illustrates the overall architecture of the Huffman decoder of JPEG data; 
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Fig. 85 illustrates data processing in the stripper block which removes byte 
aligned not encoded bit fields from the input data. Examples of the coding of tags 
corresponding to the data outputted by the stripper are also shown; 

Fig. 86 shows the organization and the data flow in the data preshifter; 

Fig. 87 shows control logic for the decoder of Fig. 81; 

Fig. 88 shows the organization and the data flow in the marker preshifter; 

Fig. 89 shows a block diagram of a combinatorial unit decoding Huffman 
encoded values in JPEG context; 

Fig. 90 illustrates the concept of a padding zone and a block diagram of the 
decoder of padding bits; 

Fig. 91 shows an example of a format of data outputted by the decoder, the 
format being used in the co-processor; 

Fig. 92 illustrates methodology utilized in image transformation instructions; 

Fig. 93 illustrates the instruction word format for image transformation 
instructions; 

Figs 94 and 95 illustrate the format of an image transformation kernal as utilized 
in the co-processor; 

Fig. 96 illustrates the process of utilising an index table for image 
transformations as utilized in the co-processor. 

Fig. 97 illustrates the data field format for instructions utilising transformations 
and convolutions; 

Fig. 98 illustrates the process of interpretation of the bp field of instruction 

words; 

Fig. 99 illustrates the process of convolution as utilized in the co-processor; 

Fig. 100 illustrates the instruction word format for convolution instructions as 
utilized in the co-processor, 

Fig. 101 illustrates the instruction word format for matrix multiplication as 
utilized in the co-processor; 

Figs 102-105 illustrates the process utilized for hierarchial image manipulation 
as utilized in the co-processor; 

Fig. 106 illustrates the instruction word coding for hierarchial image instructions; 

Fig. 107 illustrates the instruction word coding for flow control instructions as 
illustrated in the co-processor, 
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Fig. 108 illustrates the pixel organizer in more detail; 

Fig. 109 illustrates the operand fetch unit of the pixel organizer in more detail; 

Figs. 110-114 illustrate various storage formats as utilized by the co-processor, 

Fig. 115 illustrates the MUV address generator of the pixel organizer of the co- 
processor in more detail; 

Fig. 116 is a block diagram of a multiple value (MUV) buffer utilized in the co- 
processor; 

Fig. 117 illustrates a structure of the encoder of Fig. 116; 

Fig. 118 illustrates a structure of the decoder of Fig, 116; 

Fig. 119 illustrates a structure of an address generator of Fig. 116 for generating 
read addresses when in JPEG mode (pixel decomposition); 

Fig. 120 illustrates a structure of an address generator of Fig. 116 for generating 
read addresses when in JPEG mode (pixel reconstruction); 

Fig. 121 illustrates an organization of memory modules comprising the storage 
device of Fig 116; 

Fig. 122 illustrates a structure of a circuit that multiplexes read addresses to 
memory modules; 

Fig. 123 illustrates a representation of how lookup table entries are stored in the 
buffer operating in a single lookup table mode; 

Fig. 124 illustrates a representation of how lookup table entries are stored in the 
buffer operating in a multiple lookup table mode; 

Fig. 125 illustrates a representation of how pixels are stored in the buffer 
operating in JPEG mode (pixel decomposition); 

Fig. 126 illustrate a representation of how single coIot data blocks arc retrieved 
from the buffer operating in JPEG mode (pixel reconstruction); 

Fig. 127 illustrates the structure of the result organi2er of the co-processor in 
more detail; 

Fig. 128 illustrates the structure of the operand organizers of the co-processor in 
more detail; 

Fig. 129 is a block diagram of a computer architecture for the main data path unit 
utilized in the co-processor; 

Fig. 130 is a block diagram of a input interface for accepting, storing and 
rearranging input data objects for further processing; 
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Fig. 131 is a block diagram of a image data processor for performing arithmetic 
operations on incoming data objects; 

Fig. 132 is a block diagram of a color channel processor for performing 
arithmetic operations on one channel of the incoming data objects; 

Fig. 133 is a block diagram of a nr-Uifunction block in a color channel processor; 

Fig. 134 illustrates a block diagram for compositing operations; 

Fig. 135 shows an inverse transform of the scanline; 

Fig. 136 shows a block diagram of the steps required to calculate the value ior a 
designation pixel; 

Fig. 137 illustrates a block diagram of the image transformation engine; 

Fig. 138 illustrates the two formats of kernel descriptions; 

Fig. 139 shows the definition and interpretation of a bp field; 

Fig. 140 shows a block diagram of multiplier-adders that perform matrix 
multiplication; 

Fig. 141 illustrates the control, address and data flow of the cache and cache 
controller of the co-processor; 

Fig. 142 illustrates the memory organization of the cache; 

Fig. 143 illustrates the address format for the cache controller of the co- 
processor; 

Fig. 1 44 is a block diagram of a multifunction block in a color channel processor; 

Fig. 145 illustrates the input interface switch of the co-processor in more Fig. 
144 illustrates, a block diagram of the cache and cache controller; 

Fig. 146 illustrates a four-port dynamic local memory controller of the co- 
processor showing the main address and data paths; 

Fig. 147 illustrates a state machine diagram for the controller of Fig. 146; 

Fig. 148 is a pseudo code listing detailing the function of the arbitrator of Fig, 

146; 

Fig. 149 depicts the stiucture of the requester priority bits and the terminology 
used in Fig. 146. 

Fig. 150 illustrates the external interface controller of the co-processor in more 

detail; 

Figs. 151-154 illustrate the process of virtual toVfrom physical address mapping 
as utilized by the co-processor; 
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Fig. 155 illustrates the IBus receiver unit of Fig. 150 in more detail; 

Fig. 156 illustrates the RBus receiver unit of Fig. 2 in more detail; 

Fig. 157 illustrates the memory management unit of Fig. 150 in more detail; 

Fig. 158 illustrates the peripheral interface controller of Fig. 2 in more detail. 
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1. ABSTRACT 

The present invention discloses apparatus for. and a method of. 
rendering image data prior to outputting of the result iig image. A graphics 
co-processor (224) is utilized together with a host CPU (202). the former 
having a plurality of data calculation streams (241. 242. 243) arranged in 
parallel fashion. Only one of the data calculation s treams (241, 242, 243) 
is operated at any one tine. Preferably at least one (242) of the data 
calculation streams is able to be reconfigured. 

2. REPRESENTATIVE DRAWING 
Fig. 2 
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